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AIDS, malaria 
and tuberculosis 
are Surging 


Efforts to defeat the coronavirus have fuelled 
arise in other infectious diseases. Urgent 
action is needed to avert a catastrophe. 


IDS, malaria and tuberculosis (TB), three of 

the deadliest infectious diseases, together kill 

2.4 million people every year, with TB alone 

responsible for 1.5 million deaths. And deaths 

from these diseases could almost double over 
the next year, according to the Global Fund to Fight AIDS, 
Tuberculosis and Malaria, a consortium of donors that 
funds treatments. The reason: coronavirus. It’s a horrifying 
prospect, and calls for an urgent action plan. 

More than three months of lockdowns have prevented 
many people from accessing treatments for non-COVID 
infectious diseases; at the same time, new cases of these 
illnesses will have gone undetected. Although lockdowns 
are easing, it will take some time for health care to get back 
to normal, as authorities continue to prioritize COVID-19. 
Taken together, this is resulting in a surge of cases. 

That’s why there needs to beastep change in funding for 
AIDS, malaria and TB prevention, treatment and research, 
and greater public awareness of the rising threat posed by 
infectious diseases. And researchers — particularly epide- 
miologists — must continue to refine the models that are 
alerting the world to this approaching catastrophe. 

One model, developed by researchers at the London 
School of Hygiene and Tropical Medicine, projects that 
there will be around 200,000 extra deaths from TB across 
China, India and South Africa between 2020 and 2024 
(C. F. McQuaid et al. Eur. Respir. J. http://doi.org/d6ck; 
2020). The data for AIDS and malaria are just as troubling. 
In 2018, nearly half a million people in sub-Saharan Africa 
died from AIDS-related illnesses. Ifaccess to antiretroviral 
therapies is disrupted for just six months, this number is 
projected to double in the coming year, according to mod- 
elling by the WHO and the Joint United Nations Programme 
on HIV/AIDS (UNAIDS) published in May, a high not seen 
in more thana decade. 

Astudy by researchers at Nigeria’s National Malaria Elim- 
ination Programme in Abuja and Imperial College London, 
published in Nature Medicine, predicts that 779,000 people 
are at risk of dying from malaria in sub-Saharan Africa in 
2020 — more than double the number for 2019 (E. Sherrard- 
Smith et al. Nature Med. http://doi.org/d6cn; 2020). 
Researchers at the WHO reached a similar estimate. They 
modelled the impact of COVID-19 on malaria in 41 countries 
under 9 scenarios. The worst case — a continuing suspen- 
sion of campaigns to distribute insecticide-treated bed 
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nets, anda 75% drop in access to antimalarial drugs — would 
lead to 769,000 malaria deaths in sub-Saharan Africa this 
year, a mortality level not seen in 20 years. 

Right now, with the total number of COVID-19 infections 
approaching 20 million and deaths standing at more than 
700,000, we can’t say how bad the pandemic will get. 
But we can say that, without interventions, TB, AIDS and 
malaria are likely to take more lives. 


Four proposals 


Several things must now happen. First, hospitals and health 
authorities in affected cities and regions must recognize 
that AIDS, malaria and TB are surging again. In the case of 
TB, case detection — which has been affected by hospi- 
tal testing facilities being diverted for COVID-19 — needs 
to be resumed quickly. It is possible for testing facilities 
to be shared for the two diseases. Some hospitals in the 
Asia-Pacific region are using the same equipment to run 
COVID-19 tests inthe morning and TB tests in the afternoon 
— or vice versa. It is also possible to coordinate COVID-19 
testing with rapid diagnostic testing for HIV and malaria. 

Second, researchers must keep refining their models 
using more real-world data. 

Third, there is aneed for public-information campaigns. 
Public, private and non-governmental organizations must 
alert people to the risks of rising levels of infectious dis- 
eases. Such campaigns will also go some way towards 
reassuring existing patients, as well as those who become 
unwell, that they need to seek — or continue — treatment. 

Fourth, these campaigns cannot on their own keep 
surgeries and wards open, or equipment functioning. The 
resurgence of infectious diseases has created a greater 
demand for tests, treatments and research. All of these need 
more funding. In aJune report (go.nature.com/3aez6jd), 
the Global Fund calculated that an extra US$28.5 billionis 
needed to ensure that HIV, TB and malaria programmes can 
continue to function, and that researchers can continue to 
develop common diagnostic tools — especially for TB and 
COVID-19. And this is just for the next 12 months. 

The Global Fund is confident it can access $6 billion of 
this sum, ontop ofits annual spending, but it cannot raise 
the rest alone. Some of the fund’s largest international 
donors, such as the United Kingdom, are cutting back on 
science-aid funding. 

If the usual channels for fundraising seem unlikely to 
deliver, alternative approaches should be tried, such 
as public events at which governments, companies and 
philanthropic organizations are invited to make funding 
pledges. The governments of richer countries do donate 
at such events, as seen in May, when alive COVID-19 donor 
conference organized by the European Union received 
€6.15 billion (US$7.2 billion) in pledges. 

COVID-19 has turned the clock back years, if not 
decades, in the fight against infectious diseases. It is, of 
course, imperative that all measures possible are taken to 
protect people from coronavirus and to treat those who 
have become sick. But saving people from one infectious 
disease only to have them die of another is the last thing 
anyone wants. 
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A personal take on science and society 


World view 


By Nisreen A. 
Alwan 


Anegative COVID-19 test 
does not mean recovery 


Pandemic policy must include defining and 
measuring what we mean by mild infection. 


ight months into the global pandemic, we’re 

still measuring its effects only in deaths. 

Non-hospitalized cases are loosely termed ‘mild’ 

and are not followed up. Recovery is implied by 

discharge from hospital or testing negative for 
the virus. III health in those classed as ‘recovered’ is going 
largely unmeasured. And, worldwide, millions of those still 
alive who got ill without being tested or hospitalized are 
simply not being counted. 

Previously healthy people with persistent symptoms 
such as chest heaviness, breathlessness, muscle pains, 
palpitations and fatigue, which prevent them from resum- 
ing work or physical or caring activities, are still classed 
under the umbrella of ‘mild COVID’. Data from a UK smart- 
phone app for tracking symptoms suggests that at least one 
in ten of those reporting are ill for more than three weeks. 
Symptoms lasting several weeks and impairing a person’s 
usual function should not be called mild. 

Defining and measuring recovery from COVID-19 should 
be more sophisticated than checking for hospital dis- 
charge, or testing negative for active infection or positive 
for antibodies. Once recovery is defined, we can differenti- 
ate COVID that quickly goes away from the prolonged form. 

Ihad COVID symptoms of fever, cough, gastrointestinal 
upset, chest and leg pains in late March. But at that time, 
non-hospitalized patients were not tested. Since then, | 
have had bad days with some symptoms, then OK days, then 
worse days of exhaustion, making me regret what I did on 
the OK days, such as taking a short walk. 

This is a difficult time for me asa public-health academic 
engaged in pandemic action while struggling with this 
strange pattern of illness. | don’t know what it means for my 
long-term health, which is concerning as a mother caring 
for young children. 

One consolation is knowing that lam not alone. There are 
many others who have not regained their previous health, 
even months after their initial symptoms. Among them, 
fluctuating symptoms like mine are common. 

Although clinicians and researchers have an idea of who 
is at increased risk of dying from COVID, we don’t know who 
is more likely to experience prolonged ill health following 
symptomatic, or even asymptomatic, infection. The idea 
of accepting certain levels of infection to run through soci- 
ety, while protecting the vulnerable, becomes meaningless 
without considering health and productivity as outcomes 
alongside death. 

Research that follows COVID patients after discharge 
from hospital is starting. But there is still a gap in 
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quantifying and characterizing COVID-related illness in 
those not hospitalized. The consequences of failing to 
do so are significant. Some people, especially the young 
and healthy, might not see a need to follow preventive 
measures, because they expect only a few days of flu-like 
symptoms at the worst. Sick people might not get the 
support they need, and the true human and economic costs 
of the pandemic will not be correctly estimated. 

As long as ‘long COVID’ is labelled as anecdotal, it will 
not be taken seriously, and public communication will 
neglect it. We need to quantify it properly and accurately. 
We must measure recovery in those not presenting with 
severe disease at the outset. 

Let us start simple. With other common viral illnesses, 
suchas flu, we would expect recovery to mean going back 
to pre-infection levels of functionality and quality of life. 
This means we must follow up all patients with confirmed 
(by test) or highly probable (by symptoms) COVID and find 
out whether they have returned to their previous normal 
within a specified time from the onset of their symptoms. 

The ‘recovery’ definition must include duration, severity 
and fluctuation of symptoms, as well as functionality and 
quality of life. Everyone whois symptomatic would remain 
a‘case’ until they fulfilled the recovery criteria or died. This 
is basic bread-and-butter epidemiology. We just need to 
apply it to this pandemic. 

To do so, we must also define who had the infection 
in the first place. When testing is absent or inaccurate, 
physicians must be provided with universal and simple 
criteria for what constitutes clinical COVID. A good starting 
point are the studies characterizing typical symptoms on 
a population level. 

Measuring recovery is not an easy ask with health and 
surveillance systems already struggling to cope. It makes 
sense toset up disease registers, akin to cancer registries, to 
track people over time and record their condition. This could 
be done through quick monthly, and subsequently annual, 
check-ups with health-care providers. If national registers 
are not quickly forthcoming, local ones could be started. 

For surveillance, public-health agencies must prior- 
itize agreement on criteria for a definition of recovery, 
and on the structures in which these criteria could be 
implemented. We must overlay research on surveillance 
with studies of the characteristics of those experiencing 
prolonged ill health. We must learn to identify and protect 
the most vulnerable. 

The narrow narrative of death as the only bad outcome 
from COVID needs broadening to include people becoming 
less healthy, less capable, less productive and living with 
more pain. For that, we'll need better surveillance. The 
essential first step is getting clear and universal definitions 
for recovery and COVID severity. 
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The world this week 


Newsin brief 


Beirut blast among largest 
involving ammonium nitrate 


Anexplosion in Beirut’s port has killed at least 
220 people, injured more than 5,000 and left an 
estimated 300,000 people homeless. 

Lebanese authorities say that the explosion on 
4 August was caused by 2,750 tonnes of anmonium 
nitrate, achemical compound commonly used as 
an agricultural fertilizer, which had been stored for 
6 years at a port warehouse. An investigation into what 
triggered the explosion is under way, and early reports 
suggest that it was probably a nearby fire. 

The blast is one of the largest accidental ammonium 
nitrate explosions ever recorded (see ‘Explosive 
chemical’) — so powerful that it was heard more than 
200 kilometres away in Cyprus. 

Ammonium nitrate is cheap and usually safe to 
handle, but storing it can be a problem. Over time, it 
absorbs moisture and forms clumps. When a large 
quantity of compacted ammonium nitrate is exposed 
tointense heat, it can trigger an explosion. 

The disaster comes as Lebanonis struggling to cope 
with the coronavirus pandemic and an economic 
crisis. Efforts to treat injured people have been 
hampered by damage to hospitals, and the explosion 
destroyed grain silos and much of Beirut’s port. 


EXPLOSIVE CHEMICAL 


The Beirut blast, which killed at least 220 people and injured more than 5,000, is 
one of the largest industrial disasters ever linked to ammonium nitrate (NH,NO,). 
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JAPAN CONSIDERS NEW RULES ONRESEARCH 


INTERFERENCE AMID US-CHINA TENSIONS 


The Japanese government is 
considering tougher rules 

to address the risk of foreign 
interference in scientific 
research, such as more- 
thorough vetting of visa 
applications from international 
students and researchers, and 
requiring institutions to declare 
foreign sources of income. 

Last month, Japan’s cabinet 
approved an innovation strategy 
for 2020. This asks government 
agencies, research institutes 
and companies to strengthen 
codes of conduct around 
research integrity and conflicts 
of interest, and to prevent the 
outflow of sensitive research and 
technologies linked to national 
security, such as artificial 
intelligence and semiconductor 
manufacturing. It also proposes 
that government agencies 
consider withholding funding 
from institutions that fail to 
declare foreign income. 

No countries are named by 
the strategy, but researchers say 
the government is concerned 
mainly with the activities of 
Chinese institutions, including 
those with ties to the military. 

The government is now 
considering drawing up 
guidelines on these issues, says 
Takahiro Ueyama, an executive 
member of Japan’s Council 


for Science, Technology and 
Innovation (CSTI), which is 
chaired by Prime Minister 
Shinzo Abe. “This is a very 
sensitive issue,” Ueyama says. 

The development follows 
crackdowns by US science 
agencies on researchers who 
do not disclose foreign ties, 
mainly with China. In the past 
two months, four ethnic Chinese 
researchers working in the 
United States have been charged 
with visa fraud for failing to 
declare links to China’s military. 

The Japanese government 
feels under pressure to 
strengthen its research-integrity 
guidelines and safeguard its 
scientific relationship with 
the United States, says Atsushi 
Sunami,ascience-policy 
analyst at the National Graduate 
Institute for Policy Studies in 
Tokyo. “When the US and other 
Western countries started 
talking about these issues, it was 
natural that Japan would also 
address them more clearly.” 

The strategy’s language 
mirrors that of a2019 report by 
the science group JASON, which 
advises the US government. The 
report was commissioned by the 
US National Science Foundation 
over concerns about foreign 
governments acquiring US 
science and technology. 
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The world this week 


News in focus 


Meee Wee Ay 


Controlling deforestation (shown here, in a tropical rainforest in the Congo Basin) could decrease the risk of future pandemics, experts say. 


WHY DEFORESTATION AND 
EXTINCTIONS MAKE 
PANDEMICS MORE LIKELY 


Researchers are redoubling efforts to understand links between biodiversity and 
emerging diseases — and to use that information to predict and stop future outbreaks. 


By Jeff Tollefson 


s humans diminish biodiversity by 

cutting down forests and building 

more infrastructure, they’re increas- 

ing the risk of pandemics of diseases 

such as COVID-19. Many ecologists 

have long suspected this, but a study now 

helps to reveal why. When some species are 

going extinct, those that tend to survive and 

thrive — rats and bats, for instance — are more 

likely to host potentially dangerous pathogens 
that can make the jump to humans. 

The analysis of around 6,800 ecological 

communities on 6 continents adds to a 


growing body of evidence that connects 
trends in human development and biodiver- 
sity loss to disease outbreaks — but stops short 
of projecting where new disease outbreaks 
might occur. 

“We’ve been warning about this for decades,” 
says Kate Jones, an ecological modeller at 
University College London and an author of 
the study, published on 5 August in Nature’. 
“Nobody paid any attention.” 

Jones is one of acadre of researchers that has 
long been delving into relationships between 
biodiversity, land use and emerging infectious 
diseases. Their work has mostly flown below 
the radar, but now, as the world reels from the 
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COVID-19 pandemic, efforts to map risks in 
communities around the globe and to project 
where diseases are most likely to emerge are 
taking centre stage. 

Last week, the Intergovernmental 
Science-Policy Platform on Biodiversity and 
Ecosystem Services (IPBES) hosted an online 
workshop on the nexus between biodiversity 
loss and emerging diseases. The organization’s 
goal now is to produce an expert assessment 
of the science underlying that connection 
ahead of a United Nations summit that’s 
planned for September in New York City, 
where governments are expected to make 
new commitments to preserve biodiversity. 
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News in focus 


Others are calling for a more wide-ranging 
course of action. On 24 July, an interdiscipli- 
nary group of scientists, including virologists, 
economists and ecologists, published an essay 
in Science’ arguing that governments can help 
to reduce the risk of future pandemics by con- 
trolling deforestation and curbing the wildlife 
trade, which involves the sale and consump- 
tion of wild — and often rare — animals that can 
host dangerous pathogens. 

Most efforts to prevent the spread of new 
diseases tend to focus on vaccine develop- 
ment, early diagnosis and containment, but 
that’s like treating the symptoms without 
addressing the underlying cause, says Peter 
Daszak, a zoologist at the non-governmental 
organization EcoHealth Alliance in New York 
City, who chaired the IPBES workshop. He 
says COVID-19 has helped to clarify the need 
to investigate biodiversity’s role in pathogen 
transmission. 

The latest work by Jones’s team bolsters the 
case for action, Daszak says. “We’re looking 
for ways to shift behaviour that would directly 
benefit biodiversity and reduce health risks.” 


Concentrating risk 
Previous research has shown that outbreaks 
of diseases such as severe acute respiratory 
syndrome (SARS) and bird influenza that cross 
over from animals to humans have increased 
in the past few decades**. This phenomenon 
is likely to be the direct result of increased 
contact between humans, wildlife and live- 
stock, as people move into undeveloped 
areas. These interactions happen more fre- 
quently on the frontier of human expansion 
because of changes to the natural landscape 
and increased encounters with animals. 

But a key question over the past decade has 


been whether the decline in biodiversity that 
inevitably accompanies human expansion on 
the rural frontier increases the pool of patho- 
gens that can make the jump from animals to 
humans. Work by Jones and others’ suggests 
that the answer in many cases is yes, because 
a loss of biodiversity usually results in a few 
species replacing many — and these species 
tend to be the ones hosting pathogens that 
can spread to humans. 

For their latest analysis, Jones and her 
team compiled more than 3.2 million records 
from several hundred ecological studies at 
sites around the world, ranging from native 
forests to cropland to cities. They found 
that the populations of species known to 
host diseases transmissible to humans — 


“We've been warning 
about this for decades. 
Nobody paid any 
attention.” 


including 143 mammals suchas bats, rodents 
and various primates — increased as the land- 
scape changed from natural to urban, and as 
biodiversity generally decreased. 

Some researchers urge caution when 
communicating that biodiversity hotspots 
are where outbreaks are likely to occur. “My 
worry, frankly, is that people are going to cut 
down the forests more if this is where they think 
thenext pandemic is going to come from,’ says 
Dan Nepstad, a tropical ecologist and founder 
of the Earth Innovation Institute based in San 
Francisco, California, anon-profit organization 
that campaigns for sustainable development. 
Efforts to preserve biodiversity will work, he 
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Wildlife markets such as this one in Bali, Indonesia, sustain the livelihoods of many people. 
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says, only if they address the economic and 
cultural factors that drive deforestation 
and the dependency of rural poor people on 
hunting and trading wild animals. 

Ibrahima Socé Fall, an epidemiologist and 
head of emergency operations at the World 
Health Organization in Geneva, Switzerland, 
agrees that understanding the ecology — as 
wellas the social and economic trends — of the 
rural frontier will be essential to projecting the 
risk of future disease outbreaks. “Sustainable 
development is crucial,” he says. “If we 
continue to have this level of deforestation, 
disorganized mining and unplanned develop- 
ment, we are going to have more outbreaks.” 


Coordinating efforts 


One message that the upcoming IPBES 
report is likely to deliver is that scientists and 
policymakers need to treat the rural frontier 
more holistically, addressing issues of pub- 
lic health, the environment and sustainable 
development in tandem. In the wake of the 
COVID-19 pandemic, many scientists and con- 
servationists have emphasized curbing the 
wildlife trade — an industry worth an estimated 
US$20 billion annually in China, where the first 
coronavirus infections appeared. China has 
temporarily suspended its trade. But Daszak 
says the industry is just one piece of a larger 
puzzle that involves hunting, livestock, land 
use and ecology. 

“Ecologists should be working with 
infectious-disease researchers, public-health 
workers and medics to track environmental 
change, assess the risk of pathogens crossing 
over and reduce risky human activities,” he 
says. 

Daszak was an author of last month’s essay in 
Science, which argued that governments could 
substantially reduce the risk of future pandem- 
ics such as that of COVID-19 by investing in 
efforts to curb deforestation and the wildlife 
trade, as well as in efforts to monitor, prevent 
and control new virus outbreaks from wildlife 
and livestock. The team estimated that the cost 
of these actions would ring in at $22 billion 
to $33 billion annually. The total investment 
would be two orders of magnitude less than 
the $5.6-trillion price tag estimated for the 
COVID-19 pandemic, the team estimates. 

Fall says the key is to align efforts by govern- 
ment and international agencies focused on 
public health, animal health, the environment 
and sustainable development. 

With the right collaboration between 
human-health, animal-health and environ- 
mental authorities, Fall says, “you have some 
mechanisms for early warnings”. 


1. Gibb, R. et al. Nature https://doi.org/10.1038/s41586-020- 
2562-8 (2020). 

Dobson, A. P. et al. Science 369, 379-381 (2020). 

. Jones, K. E. et al. Nature 451, 990-993 (2008). 

. Smith, K. F. et al. J. R. Soc. Interface 1, 20140950 (2014). 
Faust, C. L. et al. Ecol. Lett. 21, 471-483 (2018). 


QERoN 


AMILIA ROSO/THE SYDNEY MORNING HERALD VIA GETTY 


YANG JIANZHENG/VCG VIA GETTY 


Artificial intelligence is one of the areas that the US National Science Foundation will prioritize. 


NSF'S COMMITMENT 
TO BASIC RESEARCH 


QUESTIONED 


The US National Science Foundation shifts the focus 
of its graduate fellowships to computer science. 


By Giuliana Viglione 


he major US agency tasked with 

funding basic research, the National 

Science Foundation (NSF), raised 

alarm among scientists late last month 

when it updated the guidance for its 
prestigious graduate-student fellowships to 
emphasize research in three areas of applied 
computational science. 

Critics fear that the new focus on artificial 
intelligence, computationally intensive 
research and quantum information science 
— despite the 70-year-old agency’s historical 
mandate to promote and support all basic 
scientific research — will decrease money 
for fundamental science that can struggle 
to attract funding from other government 
or industry sources. They also fear that 
the changes could make it even harder for 
graduate students from under-represented 
groups — including white women and Black 
and Latinx scientists — to win these grants. 

The change to the guidance for the Grad- 
uate Research Fellowship Program (GRFP) 
— which hands out hundreds of millions 
of dollars of research funding each year — 
comes amid concern in the United States 


about the growth of China as a research 
superpower. 

It is just one of several recent efforts to 
steer the NSF towards applied-technology 
research that have concerned researchers. In 
late May, bills introduced to both houses of 
Congress with bipartisan support proposed 
toincrease the NSF’s budget by US$100 billion 
over 5 years. The bills, which have not yet been 
voted on, would rebrand the organization as 
the National Science and Technology Foun- 
dation, and would allocate the new money to 
technology development, rather than to basic 
science. 


Fears for basic science 


“The value of basic science is not always 
apparent to non-scientists,” says Margaret 
Byron, a mechanical engineer at Pennsylvania 
State University in State College, who received 
the fellowship in 2012. “Ifthere’s pressure from 
those outside of science to push towards direc- 
tions that have more immediate applications, 
then we need to push back.” 

The NSF said in both a public statement 
and an e-mail to Nature that the move is part 
of “a coordinated federal strategy to secure 
America’s position as a global leader in 
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research and innovation’, but that the fellow- 
ship “will continue to encourage and accept 
applications in all eligible fields of science and 
engineering”. 

The agency awards around 2,000 graduate 
fellowships each year, and it requested just 
over $275 million for the programme in 2021. 
The fellowships, which fund master’s or PhD 
students for 3 years, are typically awarded in 
11 major fields of study, including engineer- 
ing, life sciences and chemistry, but had pre- 
viously not given preference to any particular 
subfields within these broad categories. 

The new GRFP guidance says applications 
are “encouraged” in all disciplines supported 
by the NSF that incorporate the three new 
high-priority research areas. 

The focus on three types of computer-based 
science is “kind of bonkers to me”, says Michael 
Hoffman, who received the fellowship in 
2003 and is now a computational biologist 
at the University of Toronto and the Princess 
Margaret Cancer Centre in Toronto, Canada. 
“These are focus areas that are already, right 
now, very well funded.” The strength of the 
GRFP, he says, is that it trains scientists across 
a broad range of disciplines that are not typi- 
cally funded by other agencies. That’s impor- 
tant because “you can never predict which 
areas are going to have the really important 
discoveries”, Hoffman says. 


Twitter backlash 


The changes prompted a backlash almost 
immediately, with scores of scientists 
expressing their discontent on Twitter. 
“These changes are incredibly restrictive and 
will almost certainly hurt bringing in bright 
and diverse students inthe sciences broadly,” 
wrote Alexandra Harmon-Threatt, a pollina- 
tion ecologist at the University of Illinois at 
Urbana-Champaign. Amy Tarangelo, a cancer 
biologist at the University of Texas Southwest- 
ern Medical Center in Dallas, tweeted: “Asa 
former fellow, I’m super disappointed in this 
decision from @NSFGRFP. GRFP should fund 
promising scientists in ALL fields without 
regard to the ‘hot topic’ of the moment. This 
will hurt students at unis without resources 
for high-level computing, etc.” 

Some welcome the move towards computer 
science, however. “I think the changes are 
important and reflect the country’s need for 
more talents and advances in the fields,” says 
Anh Nguyen, a machine-learning researcher 
at Auburn University in Alabama. These fields 
“have a strong potential to transform human 
lives” across a variety of disciplines, he says. 

Still, the concentration of funding in certain 
fields without an expansion of the programme 
means that other areas — suchas basic science 
— will be cut back, says Kelsey Lucas, amarine 
and aquatic comparative biomechanist at the 
University of Michigan in Ann Arbor, whose 
graduate work was supported by the GRFP. “By 
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focusing on certain areas, that means other 
areas are going to be getting less funding.” 

In an e-mail to Nature, a spokesperson 
for the NSF wrote, “These changes are not 
intended to exclude any areas of science sup- 
ported by NSF,” and pointed to the advances in 
basic science that the NSF has funded over the 
past seven decades. “NSF is simply signifying 
that these are areas of national importance 
and we are encouraging students to apply.” 
The spokesperson also said that the areas 
of emphasis would not change the review or 
selection process. 


Diversity at risk 


Nevertheless, scientists fear that the move 
could make it harder for certain people to win 
these prestigious grants — including Black and 
Latinx scientists. 

“Frankly, | was disappointed,” says 
Christian Cazares, a neuroscientist at the 
University of California, San Diego, anda 
current GRFP award recipient. He views the 
changes as “completely antithetical” to the 
NSF’s stated commitment to promoting 
diversity in science. The lack of diversity in 
the three new priority fields — only 18.8% 
of US computer-science bachelor’s degrees 
went to Black and Latinx students and 18.7% 
to women in 2016 — means that the move will 
perpetuate already-existing disparities, he 
says. (According to a 2014 review of the pro- 
gramme, between 1994 and 2004, about 80% 
of GRFP awards went to white applicants.) 

Research has shown funding priorities to be 
one of the main drivers of inequity in grants 
awarded to Black scientists, says Alexandra 
Clark, a neuropsychologist at the University 
of California, San Diego. In 2019, ateam at the 
US National Institutes of Health found that 
topic choice had the second-largest effect 
on the gap in award rates between white and 
Black researchers among proposals selected 
for discussion by reviewers (T. A. Hoppe etal. 
Sci. Adv. 5, eaaw7238; 2019). “When we know 
there’s an opportunity gap that’s at play,” 
Clark says, “we can’t really just continue on 
like business as usual.” 

“Increasing diversity and inclusiveness 
is a top priority for the Director and NSF,” 
wrote a spokesperson for the NSF in an 
e-mail to Nature, adding that the agency’s 
director has established a task force to make 
recommendations for addressing barriers to 
inclusion. 

Byron is hopeful that the changes will 
not significantly alter the GRFP selection 
process. But she does worry that students 
from under-represented backgrounds or 
lower-resourced schools will be dissuaded 
from applying because of the programme’s 
new emphasis. “The last thing anybody wants 
to see is for a really stellar young scientist to 
look at this new solicitation and say, ‘that’s 


not me. 
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There are growing concerns about ties between China’s military and its universities. 


AUSTRALIA'S PLAN TO END 
FOREIGN INTERFERENCE 
IN SCIENCE: DID IT WORK? 


Pioneering guidelines aren’t enough to prevent 
overseas militaries co-opting research, say experts. 


By Dyani Lewis 


Imosta year after Australia introduced 

a pioneering system for minimizing 

the risk of foreign interference 

in research — in particular, from 

overseas militaries — observers are 
divided about whether it is working. 

The guidelines, which were introduced 
last November and are widely assumed to be 
a response to concerns about the Chinese 
military’s ties to universities, encourage 
institutions to perform risk assessments on 
potential collaborators, communicate the 
risk of foreign interference to staff and bolster 
cybersecurity. They also urge universities to 
ensure that they comply with laws that restrict 
exports of certain technologies, suchas those 
that have military uses. 

Although other countries, including the 
United States, the United Kingdom and 
Japan, are grappling with similar concerns, 
Australia is the first to set such a specific set 
of guidelines for its universities. 

But some specialists warn that Australia’s 
guidelines and export laws aren’t sufficient 
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to help universities identify collaborations 
in which research could lead to military 
applications. Although the guidelines 
outline ‘best practice’ steps that universities 
could take to mitigate risks, the measures are 
optional, says Alex Joske, a China analyst at the 
Australian Strategic Policy Institute, a think 
tank in Canberra. 

Others question whether universities are 
up to the task of scrutinizing international 
partners, particularly those in China. The 
guidelines are “basically saying, do due 
diligence on your partner in an opaque 
authoritarian political system”, says Jeffrey 
Wilson, a political scientist at the think tank 
Perth USAsia Centre in Crawley, Australia. 
“You're asking people to do something no one 
can do, except maybe a spy agency,” he says. 

Others say the system is working well. 
James Laurenceson, director of the Australia- 
China Relations Institute at the University of 
Technology Sydney, says export controls on 
weapons and technologies that could have 
military uses, such as facial-recognition or 
cybersecurity software, reduce the risk of 
research being used by international armed 
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forces. He says this is the right approach — 
focusing on the research, rather than who the 
collaboration is with. “Fundamental questions 
about the science being conducted — I think 
they’re more important ones for us to be 
asking,” he says. 


Research partners 


On paper, China is Australia’s biggest research 
partner. In 2019, Australian researchers 
co-authored close to 14,000 papers with 
authors who had affiliations in China, 
according to an analysis of papers indexed in 
the Scopus database (go.nature.com/3adyful). 
That’s 16.2% of Australia’s research output, 
more than for any other international partner 
nation (see ‘Australia’s top collaborators’). 

Fierce competition for limited government 
funds in Australia is driving collaborations, 
particularly with China, says Wilson, adding 
that this exacerbates the risk of interference. 
“One of the challenges for Australian 
universities is that they have been heavily 
incentivized to seek foreign researchincome,” 
he says. 

In most cases, research collaboration 
benefits all parties, says Yun Jiang, a 
geoeconomist at the Australian National 
University in Canberra. Problematic research 
is only asmall part of research collaborations, 
she says. 

But as China increases links between its 
civilian universities and the armed forces — a 
policy known as military-civil fusion — those 
links are becoming more difficult to identify, 
says Joske. Chinese universities, he says, “are 
more and more integrated with military assets, 
especially around research and development. 
They’re working on classified projects and alot 
of their graduates are going into the military 
or the defence industry.” 


Military ties 

In 2019, Joske and his colleagues developed 
the China Defence Universities Tracker, which 
places Chinese institutions on a risk scale 
according to how closely they are associated 
with the military and whether they have been 
accused of or engaged in espionage and 
intellectual-property theft. 

Ninety-two institutions — including 60 run 
by the People’s Liberation Army or security 
and intelligence agencies, and 20 civilian 
universities — are rated as very high risk. A 
further 23 civilian universities are graded as 
high risk. 

Joske is concerned about collaborations 
with universities in China that have military 
ties — and about howthese should be policed. 
In his view, the Australian government 
should provide universities with information 
about which foreign institutions deserve 
more scrutiny, and should establish a 
national research-integrity office that can 
enforce existing national-security and 


export-control laws. 

Some academics think that collaborations 
with military institutions should be ruled out 
entirely. “These universities are exceptionally 
important to the development of China’s 
military technology,” says Clive Hamilton, 
a public-policy researcher at Charles Sturt 
University in Canberra, who has investigated 
Chinese influence at Australian universities. 
“So why take the risk?” 

The tracker also lists 12 state-owned 
defence-industry conglomerates, 4 of which 
have links to overseas universities. 

One example of the kinds of relationship 
that are raising alarm is outlined by Joske 
in a 2019 report accompanying the tracker 
(see go.nature.com/3gjynsf). The report 
noted that one of those conglomerates, 
the Commercial Aircraft Corporation of 


“We want tobe transparent. 
But how do you deal with that 
when your research partner 
is fundamentally not?” 


China (COMAC), signed a memorandum of 
understanding with Monash University in 
Melbourne in May 2017 to design 3D-printed 
aircraft components. The collaboration will 
also fund a Aus$10-million (US$7-million) 
Aeronautical Research Centre that will be 
established at the university, aspokesperson 
from the university said. 

Joske says the collaboration is a cause 
for concern because, last year, the 
cybersecurity company CrowdsStrike in 
Sunnyvale, California, accused COMAC 
of using technology stolen through 
cyberespionage from rival airlines to design 
its C919 commercial aircraft, which could be 
converted into a military surveillance aircraft. 


AUSTRALIA'S TOP COLLABORATORS 


Last year, researchers in Australia co-authored 
almost 14,000 papers with people at Chinese 
institutions — more than with any other country. 
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COMAC did not respond to Nature’s questions 
about the allegations. 

But the university defended the partnership, 
saying it followed the Australian Code for the 
Responsible Conduct of Research and the 
university’s own policies and procedures. 
“The university regularly monitors advice 
provided by the government, intelligence 
agencies and the wider education sector, and 
acts in accordance with advice received,” the 
spokesperson said. 


Group of Eight 

Several Australian institutions have made 
changes in response to the guidelines 
produced last year. Nature contacted the 
‘Group of Eight’ leading research universities, 
and of the seven that responded, five said 
they had introduced new processes or were 
deciding whether to do so. 

But some experts think that more needs 
to be done. The guidelines recommend that 
foreign affiliations and funding be recorded. 
Hamilton says universities should go a step 
further, and make these registers public. 
“Transparency must be priority number one,” 
he says. “Australian universities are public 
institutions, and I can see no reason why the 
public should not be permitted to know the 
affiliations and financial links academics 
may have with other organizations, at home 
or abroad,” he says. 

And he says the universities should take 
a hard line with researchers who fail to 
disclose foreign ties, including firing them 
in some cases. The government should also 
be stricter with universities, says Hamilton: 
“Universities that fail to heed government 
rules and guidelines concerning research links 
that jeopardize national security should be 
excluded from receiving funding from the 
Australian Research Council.” 


Limits to transparency 


In the United States, funding agencies have 
alerted institutions to grant recipients 
with potential undisclosed foreign ties. 
Investigations have led to dozens of 
researchers being sacked or forced to give 
back research funding. 

But transparency has its limits, says Wilson. 
“We want to be transparent. But how do you 
deal with that when your research partner 
is fundamentally not?” he asks. Although 
information onconnections to the military is 
openly available in some instances, he says, it 
could easily be hidden. 

In June, the Australian government 
announced that it had created a newintegrity 
unit to identify and analyse emerging threats 
to the quality of higher education, and to 
assist universities in addressing foreign- 
interference threats. Joske says it’s a positive 
move, but it will take time to prove that it is 
working. 
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Newsin focus 


Pioneering research director: 
‘We're ina cataclysmic time of change’ 


Rita Colwell is the former director of the 

US National Science Foundation (NSF) — 
the first woman to hold that post — anda 
leader in cholera research. In her new book, 
A Lab of One's Own: One Woman’s Personal 
Journey Through Sexism in Science (written 
with Sharon Bertsch McGrayne), she opens 
up on one topic she hasn't said much 

about publicly: her battle to improve the 
situation for women in science. Colwell, a 
microbiologist at the University of Maryland 
College Park, spoke to Nature about 
discrimination, and how her experiences as 
a researcher and agency leader during the 
2001 anthrax bioterror attacks can inform 
the response to the coronavirus pandemic. 


You've been a scientist since the 1950s. 
What's changed, and what hasn't, regarding 
sexism in science? 

When | was an undergraduate, | went to the 
department chair to ask for a fellowship to 
allow me to pursue a master’s degree. He 
told me, quite bluntly, that they didn’t waste 
fellowships on women. | don’t think any 
chair, dean or faculty member would say that 
today. It would probably be nuanced, to the 
effect of, “We don’t have any available.” That 
makes it less obvious, and I’m not so sure 
less hurtful — because it still deflects the 
career path. 
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Protests have erupted in the United States over racism and COVID-19 shutdowns. 


What do you want early-career 

female scientists to take away from 

your book? 

That you’re not alone. Your experiences that 
you endure are shared. Like other women, | 
assumed the problems | have had were some 
sort of problem of mine. But in fact, it’s really 
the system. 


What should institutions do to improve 

the situation for women? 

Agencies, particularly funding agencies, 
can have a tremendous effect. The NSF has 
really been a leader, tracing back to when 
Mary Clutter, who was an assistant director, 
forcibly mandated [in 1989] that committees 
and scientific meetings that received funding 
had to have female representatives on the 
committee and female speakers at the 
meeting. 


Racism in science is also deeply 

entrenched. Why is it important to 

change that? 

There’s tremendous talent distributed 
throughout the population, and the country 
needs all of it. It’s really important to draw 
from 100% of the population — and not 50% or 
less. The challenges we're facing right now are 
huge. We're in one of the cataclysmic times of 
change. 
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In 2001, you helped to hunt down the 
source of the US anthrax attacks. What 
lessons from that could help scientists 
investigating COVID-19? 
The anthrax episode was a fantastic 
experience showing patriotic cooperation 
among agencies and among branches of 
Congress. Because | was director of the NSF, 
| was able to have a leadership role, and 
because | was a microbiologist, | was poised 
to recognize immediately that we had to 
sequence the bacterium that had been sent 
out as a bioweapon. The group met every 
week for an hour in a classified meeting. We 
had maybe 16 or 17 agencies represented. 
The lesson was collaboration and 
cooperation. No single agency could solve 
the problem. There was shared trust and 
shared commitment, and no politics in any 
of our decisions. If we had been able to 
have an inter-agency, intergovernmental 
interaction straight away in January or 
February, | think we would not have 150,000 
COVID-19 deaths — and climbing — in the 
United States. 


How has your laboratory changed its 
emphasis in response to COVID-19? 

| have been working on cholera for my 
entire career. It is an aquatic bacterium 
distributed in the environment. Twenty 
years ago, we developed the use of 
satellite sensing to monitor environmental 
parameters that allowed us to track 

and predict cholera outbreaks. We're 
modifying that model for COVID. It’s very 
interdisciplinary. That's the power of 
understanding diversity — human, as well as 
environmental — and protecting both. 


What role should international scientific 
collaboration have in fighting the pandemic? 
Diseases don’t carry passports, and they 
don’t behave like tourists clearing the 
border. It’s almost anti-human not to 
understand that we are a global society, and 
that the benefit of science — for example, 
vaccines — needs to be shared. And it needs 
to be shared empathetically in a way that 
makes it affordable for the poorest citizens of 
the world. 


Interview by Alexandra Witze 
This interview has been edited for length and 
clarity. 


THOMAS HENRIKSON 
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WHAT ANIMALS 
REALLY THINK 


Neuroscientists are scrutinizing mountains of 
data to uncover how brains create emotions 
and other internal states, such as aggression 
or desire. By Alison Abbott 


wo years ago, Jennifer Li and 
Drew Robson were trawling 
through terabytes of data from a 
zebrafish-brain experiment when 
they came across a handful of cells 
that seemed to be psychic. 
The two neuroscientists had 
planned to map brain activity while 
zebrafish larvae were hunting for food, and 
to see how the neural chatter changed. It 
was their first major test of a technological 
platform they had built at Harvard Univer- 
sity in Cambridge, Massachusetts. The plat- 
form allowed them to view every cell in the 
larvae’s brains while the creatures — barely 
the size of an eyelash — swam freely in a 
35-millimetre-diameter dish of water, snack- 
ing on their microscopic prey. 

Out of the scientists’ mountain of data 
emerged a handful of neurons that predicted 
when a larva was next going to catch and 
swallow a morsel. Some of these neurons even 
became activated many seconds before the 
larva fixed its eyes on the prey’. 


Something else was strange. Looking in more 
detail at the data, the researchers realized that 
the ‘psychic’ cells were active for an unusually 
long time — not seconds, as is typical for most 
neurons, but many minutes. In fact, more or 
less the duration of the larvae’s hunting bouts. 

“It was spooky,” says Li. “None of it made 
sense.” 

Li and Robson turned to the literature and 
slowly realized that the cells must be setting 
an overall ‘brain state’ — a pattern of prolonged 
brain activity that primed the larvae to engage 
with the food in front of them. The pair learnt 
that, in the past few years, other scientists 
using various approaches and different spe- 
cies had also found internal brain states that 
alter how an animal behaves, even when noth- 
ing has changed in its external environment. 

Some, such as Li and Robson, had come to 
the discovery serendipitously while trudging 
through their own brain-wide data. Others have 
hypothesized that neurons coding for inter- 
nal brain states must exist, and have actively 
sought them in discrete and well-researched 
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brain regions. For example, earlier this 
year’, neurobiologist David Anderson at the 
California Institute of Technology (Caltech) 
in Pasadena and his colleagues identified an 
internal brain state — represented by a small 
network of neurons — that prepares fruit flies 
to engage in courtship or fighting behaviours. 

Neuroscientists wanting to understand the 
brain’s coding language have conventionally 
studied how its networks of cells respond to 
sensory information and how they generate 
behaviour, such as movement or speech. But 
they couldn't look in detail at the important bit 
in between — the vast quantities of neuronal 
activity that conceal patterns representing 
the animal’s mood or desires, and which help 
it to calibrate its behaviour. Even just a few 
years ago, measuring the activities of specific 
networks that underlie internal brain states 
was impossible. 

A slew of new techniques is starting to 
change that. These methods allow scientists 
to track electrical activity in the brain in 
unprecedented detail, to quantify an animal’s 
natural behaviour on millisecond timescales, 
and to find patterns in the mountains of data 
these experiments generate. These patterns 
could be signatures of the innumerable inter- 
nal states that a brain can adopt. Now the 
challenge is to find out what these states mean. 

Some neuroscientists are daring to wield 
the technologies to probe one powerful group 
of internal brain states: emotions. Others are 
applying them to states suchas motivation, or 
existential drives such as thirst. Researchers 
are even finding signatures of states in their 
data for which they have no vocabulary. 

The current trickle of research papers on 
internal brain states is gaining momentum. 
The work might even have potential clinical 
applications. “Mental illness is essentially 
disruption of internal states,” says Joshua 
Gordon, director of the US National Institute 
of Mental Health in Bethesda, Maryland. “They 
need to be understood.” 


Frames of mind 


The brain of any animal is constantly 
bombarded with information about the 
creature’s environment from sensory organs 
such as the eyes, ears, nose or skin. All of this 
information is initially processed in the brain’s 
sensory cortex. Then come more mysterious 
processing steps, in which that information 
is filtered through multiple internal brain 
states representing the creature’s constantly 
changing moods and needs. That finally leads 
the motor cortex to generate movements that 
are appropriate to the circumstances — to flap 
away a tickling fly, for example, or to move 
towards a tasty treat. Internal states canalsobe 
generated entirely in the brain, without sensory 
input and without a behavioural output: think 
of daydreaming, or replaying the events of the 
day in your mind. 


Nature | Vol 584 | 13 August 2020 | 183 


Feature 


Over the past few years, insights into the 
nature of internal states are changing how 
neuroscientists who study brain networks think 
about animal behaviour. “We used to think of 
animals as being kind of stimulus-response 
machines,” says neuroscientist Anne 
Churchland at Cold Spring Harbor Labora- 
tory, New York. “Now we're starting to realize 
that all kinds of really interesting stuff is being 
generated within their brains that changes the 
way that sensory inputs are processed — and 
so changes the animals’ behavioural output.” 

Sketching out how to study this intriguing 
middle ground has long been a preoccupation 
for Anderson. Six years ago, he decided to 
create a theoretical framework for researchinto 
internal brain states that represent emotion. He 
was irked by the view of some psychologists, 
who think that because animals can’t express 
their feelings in words, those feelings can’t 
be studied at all. Together with his Caltech 
colleague Ralph Adolphs, Anderson devel- 
oped and published a hypothesis? about the 
characteristics that neural circuits associated 
with internal brain states should have. 

Most importantly, they thought, an internal 
brain state should outlast the original stimulus 
that triggered it. So a key feature of a neural 
circuit underpinning such a state would be 
its persistence, he says. “If you are hiking in 
mountains and see a snake, then you might 
jump in fear,” says Anderson. “Ten minutes 
later, your brain’s internal state of fear is still 
active, so when you see a stick on your path 
you might jump again.” 

Other characteristics of internal states 
should include generalizability, meaning that 
different stimuli should be able to prompt the 
same state, and scalability, in which different 
stimuli can create states of different strength. 
The paper became influential. Li says that it 
“was inspirational” as she and Robson were 
trying to make sense of their psychic cells. 

Anderson and Adolphs published their 
paper in 2014, just as a raft of neurotechnol- 
ogies was starting to make the necessary 
experiments feasible. It was already possible 
to record from large numbers of individual 
neurons at the same time, and since then the 
technologies have improved and expanded 
remarkably, allowing scientists to analyse 
previously inaccessible activity. 

Leading the pack is the Neuropixels probe, 
just 10 mm long, which can directly record 
activity in hundreds of neurons across 
different brain areas*. And special imaging 
techniques can indicate where as many as 
tens of thousands of individual neurons are 
active across the brain. In calcium imaging, for 
instance, animals are genetically engineered 
to express a molecule in their cells that detects 
calcium ions — when these pour into aneuron 
as it fires, the molecule fluoresces. 

New automatic behaviour monitors take 
video recordings of freely behaving animals 
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Distinct groups of neurons control whether zebrafish larvae explore (right) or stay put (left). 


over many hours, and analyse every movement 
in millisecond elements. The elements can 
then be aligned with neural recordings, 
matching moment-to-moment brain activity 
with specific movements. 

Neuroscientists have capitalized ona surge 
in machine learning, artificial intelligence and 
new mathematical tools to make sense of the 
gigabytes or terabytes of data that any exper- 
iment with these technologies can generate, 
and to coax out the neural activation patterns 
that could represent internal brain states. 


Ready to engage 

For his first study of aninternal state, Anderson 
decided to build on his laboratory’s previous 
interest in aggression in the fruit fly, which has 
a tiny brain containing about 100,000 neu- 
rons. In many animal species, males start to 
fight each other in the presence of females — 
a well-established behaviour that Anderson 
calls the ‘Helen of Troy effect’, after the Greek 
myth about a woman whose competing suitors 
started a war. Fruit flies are no exception: 
indirect evidence suggests that exposure to 
females causes males to engage in both court- 
ship songs and aggressive behaviour towards 
other males for many minutes. “That’s along 
time in the short life of a fruit fly,” he says. 

He decided to search for neural activity 
that correlated with the persistent courtship 
and fighting behaviours that are initiated by 
neurons known as P1, found in a region that 
controls such social behaviours. These neu- 
rons fire so quickly that they alone couldn't 
be responsible for maintaining an internal 
state. Using imaging techniques along with 
automated behavioural analysis, his group 
identified cells in other brain areas that become 


© 2020 Springer Nature Limited. All rights reserved. 


active as a consequence of P1 activation. 

Most of these ‘follower cells’ switched quickly 
on and off, but a cluster called pCd neurons 
stayed active for many minutes. When the 
researchers inserted a light-sensitive protein 
into these cells and switched them off using 
a flash from a laser, the persistent effect of P1 
activation on behaviour disappeared. When 
they activated them directly, bypassing P1, 
nothing happened: the pCd neurons needed 
Plas a trigger and, once sparked into action, 
they stayed on for much longer than the initial 
prompt’. If Anderson had to give the state a 
name, he might call it the ‘ready-to-engage-in- 
these-social-behaviours’ state, he says. 

His team has conducted a similar exper- 
iment in mice®, which have more complex 
brains containing about 100 million neurons. 
The researchers found a particular group of 
neurons inthe hypothalamus that, just like the 
pCd neurons, became persistently activated 
in association with an innate drive — this time, 
fear. When the scientists placed a rat close to 
experimental mice for just a few seconds, 
the mice responded defensively by hugging 
the wall for several minutes and the group 
of neurons remained active for all this time. 
When the team again used light to switch the 
neurons on and off, the wall-hugging behav- 
iour came and went in tandem, even with no 
rat present. 

Neuroscientists are now discovering other 
groups of neurons with persistent activity in 
different brain areas. Using calcium imagingin 
mice, Andreas Liithi at the Friedrich Miescher 
Institute for Biomedical Research in Basel, Swit- 
zerland, andJan Griindemann atthe University 
of Basel searched in the amygdala, whichis cen- 
tral to the regulation of a range of emotions 
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and behaviours. The team found two different 
populations of neurons that displayed sus- 
tained but opposing activation when the mice 
switched between two distinct behaviours*® — 
exploring the environment and performing 
defensive behaviours such as freezing. 

Griindemann acknowledges that the 
amygdala cells are unlikely to be working in 
isolation, and that cells across the whole brain 
are involved in maintaining the explorative or 
defensive states. “I’m sure it is just one nodein 
larger, brain-wide networks,” he says. 


The whole picture 


Whereas many researchers have searched 
particular brain areas for neurons that have 
enduring activity, Liand Robson, who moved 
to Germany last September to jointly runa 
lab at the Max Planck Institute for Biological 
Cybernetics in Tubingen, came across their 
persistently active neurons almost by chance. 

Their zebrafish larvae are less complex than 
fruit flies, having only 80,000 or so brain cells. 
Because these baby fish are transparent, the 
activity of nearly all of their neurons can be 
monitored simultaneously using calcium 
imaging. 

The pair has developed a method of 
concurrently following both the movements 
and the neural activity as fish larvae swim 
freely around a dish. They deploy a fluores- 
cent-microscope tracking system that moves 
on its imaging platform to keep the fish in 
constant view, and captures every flash of 
each neuron as the larvae move. The system 
also films them — typically for 90 minutes, 
generating 4.5 terabytes of data — allowing 
the experimenters to align movement with 
neuronal activity second by second. 

Fish larvae might not seem to have the rich 
internal life enjoyed by mice, or even flies, 
but they have at least one robust behavioural 
choice to make in their lives — whether to 
hunt locally, or to swim to unfamiliar waters 
to search for new food sources. When Li and 
Robson watched larvae making this choice, 
they found three groups of neurons: one that 
was persistently active during local hunting, 
another that stayed active during exploration 
and a third that flashed on briefly as the fish 
switched states!. Surprisingly, hunger didn’t 
seem to influence the states, which switched 
automatically every few minutes — “just like our 
own sleep-wake states switch automatically, 
but ona much shorter timescale”, Robson says. 

Neuroscientists working with more com- 
plex organisms can’t monitor the whole brain 
at once, but they have been able to find hints 
of internal brain states with networks that are 
widely distributed in the brain. In technically 
challenging experiments in mice, they have 
recorded the activity of thousands of neurons 
throughout the brain using calcium imaging, 
and of hundreds of neurons using a single 
Neuropixels electrode, several of which can 


be inserted at once. 

Inastudy published last year’, neuroscientist 
Karl Deisseroth at Stanford University in Cali- 
fornia and his team used Neuropixels probes 
to record the activity of 24,000 neurons across 
34 cortical and subcortical brain regions in 
thirsty mice that were licking water from a 
spout. The scientists were able to tease out 
signals related to the brain state of thirst 
from signals related to licking behaviour. They 
found that these state-signalling neurons were 
activated throughout the brain — not just in 
the hypothalamus, where dedicated thirst 
neurons are located. 

Using these extensive recording techniques, 
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neuroscientists are finding that there is a lot 
going on beneath the surface when an animal 
performs a task — and not all of it seems rele- 
vant at first glance. In landmark papers last 
year, groups led by Kenneth Harris at Uni- 
versity College London and by Churchland 
showed that when a mouse is engaged in a 
task, neurons activate throughout the brain, 
but that a large proportion of the activation 
is not correlated with the task at all®*. Some 
activity correlated instead with the animals’ 
fidgety movements. But around two-thirds 
of the off-task activation didn’t tally with any 
movement or action. “Part of this may be 
related to internal brain states,” says Harris. 


Busy brain 


Many neuroscientists say that the sheer volume 
of data pouring out of whole-brain experi- 
ments is also the field’s biggest bottleneck. 
But they have been making progress in devel- 
oping techniques to sift through the flood of 
measurements. One popular approach is to 
use a mathematical method called the hidden 
Markov model (HMM) to predict the probabil- 
ity that a system will switch between different 
states at a particular time. 

Mala Murthy at Princeton University, New 
Jersey, and her colleagues used the HMM to 
discover rhythms in the brains of male fruit 
flies’ that influenced their choice of song 
pattern when courting females. Whether 
male flies choose on a moment-to-moment 


© 2020 Springer Nature Limited. All rights reserved. 


basis to sing in staccato pulses or longer hums 
depends in large part — but not totally — on 
how the females respond to them. Murthy’s 
group found that three different internal brain 
states also affected the male’s song choice. 
They dubbed the fly dispositions Close, Chas- 
ing and Whatever. 

No matter the complexity of the model 
organism that individual researchers have 
adopted — worm, fish, fly or mouse — the 
question of how the entire brain coordinates 
internal states “is what we are all starting to 
think about”, says Steve Flavell at the Massachu- 
setts Institute of Technology in Cambridge. In 
2013, Flavell and his colleagues discovered that 
even the brain of the Caenorhabditis elegans 
worm, which has only 302 neurons, displays 
properties of internal brain states that drive 
particular behaviours, including two sets 
of persistently active neurons controlling 
whether the animal lingers locally or moves 
with purpose". His group has since identified 
the full circuitry involved in the two states and 
switching between them”. 

Aside from their questions about the basic 
biology, researchers have an eye on the clini- 
cal benefit of understanding how a particular 
state manifests in the brain. Those studying 
pain in rodent models, for instance, rely on 
standard tests such as observing when a rat 
lifts its paw from a hot plate. “That movement 
reflects protective aspects of pain, but not the 
actual perception of pain,” says neurologist 
Clifford Woolf at the Boston Children’s Hos- 
pital in Massachusetts. That makes it a poor 
model for pain, he argues, because it is one 
step removed from the actual sensation. He 
has launched a research programme totry to 
directly read brain signals that indicate the 
internal state of pain perception — potentiallya 
more timely and specific readout than waiting 
for the animal’s response. “I’m extremely opti- 
mistic that we’re in one of those rare stages in 
science where this is going to be a transforma- 
tion of the way we do things,” he says. 

In this new field, even the basics are up for 
grabs, says Li. “At this stage, we are still trying 
to understand what the questions are.” 


Alison Abbott is a writer based in Munich, 
Germany. 
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Crunch, rip, freeze or 
decay — how willit all end? 


Astrophysicist Katie Mack’s book explores all the ways 
the Universe could destroy itself. By Ramin Skibba 


cientists know how the world will 

end. The Sun will run out of fuel and 

enter its red-giant phase. Its final 

burst of glory will expand and engulf 

the closest planets, leaving Earth a 
charred, lifeless rock. Our planet has around 
five billion years left. 

With this grim image, theoretical 
astrophysicist Katie Mack begins her book 
on the end of the Universe — a much more 
uncertain prospect. Cosmologists generally 
look backwards, because all the evidence 
they can examine with telescopes is far away 
and concerns things that happened long ago. 
Using the motions of distant stars and galax- 
ies to predict possible futures involves more 
speculation. 

In Mack’s hands, this speculation makes for 
a fascinating story. Humans are, she writes, “a 
species poised between an awareness of our 
ultimate insignificance and an ability to reach 
far beyond our mundane lives, into the void, to 
solve the most fundamental mysteries of the 
cosmos”. She is a talented communicator of 
complex physics, and the passion and curiosity 
about astronomy that have made her a popular 
speaker and Twitter presence are evident here. 
(As are some nerdy jokes and aless compelling 
coda about new physics research tangential to 
the central theme.) 

Mack begins at the beginning, with the Big 
Bang. What followed was inflation — a period 
of rapid expansion. Then, structures of dark 
matter formed and the building blocks of stars, 
planets, life and galaxies assembled. Currently, 
dark energy, thought to pervade the Universe, 


THE END\ Od FI The End of Everything 
Be Astrophysically 
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Katie Mack 
Scribner (2020) 
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The Universe is expanding — for now. 


somehow counteracts the forces of gravity to 
keep driving expansion. 

The Universe’s fate depends on whether 
that expansion will continue, accelerate or 
reverse. 

Astrophysicists long considered the most 
likely denouement to be a reversal of the Big 
Bang — the Big Crunch. Outside our cosmic 
neighbourhood, every galaxy is zooming 
away from us; a clear sign of expansion. If the 
Universe holds enough matter, including dark 
matter, the combined gravitational attraction 
of everything will gradually halt this expansion 
and precipitate the ultimate collapse. Over 
time, galaxies, then individual stars, willsmash 
into each other more frequently, killing off any 


© 2020 Springer Nature Limited. All rights reserved. 


life on nearby planets. In the final moments, as 
densities and temperatures soar inacontract- 
ing inferno, all that remains will extinguish in 
asingle point. 

But dark energy might mean that a different 
end awaits. The early years of the Universe’s 
evolution were determined by the amount of 
matter it held; over the past few billion years, 
dark energy has begun to dominate, pushing 
the universe outwards. Current data from the 
European Space Agency’s Planck telescope 
and other sources are consistent with this 
expansion continuing forever. 

Called the Heat Death or Big Freeze, this 
apocalypse will be “slow and agonizing”, Mack 
writes. Inthermodynamic terms, she explains, 
the Universe will approacha state of minimum 
temperature and maximum entropy. As 
everything gets farther and farther apart, the 
material of dead stars will disperse so that 
new stars can’t form, and the galaxies they’re 
part of will gradually stop growing. It’s like a 
suffocation of all astrophysical activity, as the 
fuel for growth and reproduction becomes so 
diffuse as to be unusable. It is an end “marked 
by increasing isolation, inexorable decay, and 
an eons-long fade into darkness”. 

The third demise that Mack discusses is the 
Big Rip. This is in store if dark energy accel- 
erates expansion even more than is currently 
expected. As the Universe balloons, even- 
tually, gravitational forces won't be able to 
keep galactic clusters together. Stars will be 
stripped away from each other, and solar sys- 
tems such as ours won't have the strength to 
stay together. The remaining stars and plan- 
ets will explode. Finally, the last atoms will be 
ripped apart. 

The latest measurements point to a Heat 
Death, but a Big Crunch or Big Rip are within 
their uncertainties. 

The final doomsday scenario that Mack 
describes is extremely unlikely: vacuum decay. 
A tiny bubble of ‘true vacuum’ could form, 
owing to instability in the field associated with 
the Higgs boson. That might happen if, say, a 
black hole evaporates in just the wrong way. 
Such a bubble would expand at the speed of 
light, destroying everything, until it cancels 
the universe. Vacuum decay might already 
have begun in some distant place. We won't 
see it coming. 

Not to worry, though. As Mack counsels, 
whatever it looks like, the end probably won't 
be nigh for at least 200 billion years. 


Ramin Skibba is an astrophysicist turned 


science writer based in San Diego, California. 
e-mail: raminskibba@gmail.com 
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Readers respond 


Correspondence 


COVID-19: time to 
re-imagine academic 
publishing 


High-profile retractions 
have highlighted how the 
conventional model of academic 
publishing has struggled to keep 
pace with the race to understand 
the new coronavirus, SARS- 
CoV-2. The system is ripe for 
innovation. To that end, an 
open-access overlay journal 
known as Rapid Reviews: 
COVID-19 (RR:C19; see go.nature. 
com/3fufauw) uses the speed of 
technology to democratize the 
review process and strengthen 
the quality of research. 

RR:C19 was launched this 
year by the MIT Press and 
the University of California, 
Berkeley, with support from the 
Patrick J. McGovern Foundation. 
Scientists, publishers and 
philanthropic foundations 
work together to swiftly deploy 
new models for digitally 
enabled publishing. The journal 
promotes rapid and transparent 
peer review of promising or 
controversial preprints, as well 
as dynamic curation of content 
(see B. M. Stern and E. K. O’Shea 
PLoS Biol. 17, e3000116; 2019). 

Philanthropic foundations 
have been leaders in funding 
risky scientific ventures. In 
our experience, extending 
that support to advance the 
publishing process will boost 
the quality of research and 
accelerate its dissemination. 


Vilas Dhar* Patrick J. McGovern 
Foundation, Boston, 
Massachusetts, USA. 


Amy Brand* The MIT Press, 
Cambridge, Massachusetts, USA. 
amybrand@mit.edu 

*\.D. and A.B declare competing 
interests; see go.nature. 
com/2xajnzt 


COVID-19: indexed 
data speed up 
solutions 


Since the COVID-19 pandemic 
began, cross-disciplinary data 
sets for the virus have been 
proliferating daily. However, 
these can be difficult for 
researchers to find, link to and 
reuse — for example, if they 
want to explore new hypotheses 
or to test existing ones. To 
this end, we have developed 
the COVID-19 Data Index 
(www.covid19dataindex.org). 

Originally funded by the US 
National Institutes of Health 
as part of the Big Data to 
Knowledge project, the index 
hosts a metadata catalogue of 
COVID-19 data sets. 

These range from, 
for example, clinical, 
sociodemographic, 
environmental, economic and 
mobility data to case statistics 
and genomic sequences. 
These data are collected from 
large repositories, research 
papers and individual online 
sources, among others. Users 
can filter search results by type 
(‘-omics’ data versus clinical 
data, for instance), repository 
or geographic location. The 
index then supplies links to the 
original data and the download 
page. 

We update the COVID-19 
Data Index daily. Finding data 
on COVID-19 is no longer an 
obstacle that could delay 
discoveries. 


Lucila Ohno-Machado*% University 
of California, San Diego, La Jolla, 
California, USA. 
lohnomachado@health.ucsd.edu 


Hua Xu The University of Texas 
Health Science Center at 
Houston, Texas, USA. 

*L.O.-M. declares competing 
interests; see go.nature. 
com3gfcgv 
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Denmark recycling 
plan will cut waste by 
two-thirds 


As one of the European Union’s 
largest energy consumers 

and greenhouse-gas polluters 
(go.nature.com/33piuuv), 
Denmark will launch the EU’s 
most-ambitious recycling plan 
inJuly next year. It aims to cut 
the country’s annual amount 
of waste for incineration from 
800 to 250 kilograms per 
capita, reducing carbon dioxide 
emissions to 0.7 million tonnes 
by 2030. Citizens will sort their 
waste into ten different types. 

The moveis in part aresponse 
to the COVID-19 pandemic 
and to anew EU directive for 
environmental sustainability 
that promotes a circular 
economy, lower emissions 
and a reduction in the use of 
raw materials and hazardous 
substances (J. B. Zimmerman 
etal. Science 367, 397-400; 
2020). It is hoped that the plan 
will limit ecosystem damage 
and the health effects of toxic 
industrial chemicals. It will 
also discourage Denmark’s 
unacceptable export of waste to 
low-income countries. 

If other countries were to 
adopt similar practices, the 
world would align faster with 
the United Nations Sustainable 
Development Goals on 
sustainability and planetary 
health. 


Su Shiung Lam Universiti Malaysia 
Terengganu. 


Aage K. O. Alstrup Aarhus 
University, Aarhus, Denmark. 


COVID-19: full peer 
review in hours 


The impetus to rapidly 
disseminate scientific results 
during a crisis ahead of peer 
review could cause governments 
and international organizations 
to act prematurely — or to be 
reluctant to act at all. Having 
struggled with such challenges 
in our COVID-19 work, we 
recommend our tested review 
system, which has an ultrashort 
submission-to-acceptance time. 

One of us (W.S.) runsa 
workshop every September 
to identify horizon-scanning 
issues in conservation, which 
we aim to report on in Trends 
in Ecology and Evolution the 
following January. The journal’s 
editor (formerly K.A.L.) agrees 
a submission date and selects 
referees. Authors send ina 
working draft a week before 
formal submission so that 
referees have time to prepare 
their comments. Inthe first year 
(2009), the time from formal 
submission to return of detailed 
comments was 90 minutes. 

The crucial features of this 
process are advance selection 
of referees and the provision of 
a draft manuscript. Agreeing a 
submission date makes planning 
easier for referees but is not 
essential. 

Our model could be used for 
ultrafast peer review of COVID- 
19 papers (see M. A. Johansson 
and D. Saderi Nature 579, 29; 
2020). Setting up a pool of 
referees dedicated to rapid 
review of key papers would 
help relieve pressure on 
overstretched individuals. 


Christian Sonne Aarhus 
University, Roskilde, Denmark. 
cs@bios.au.dk 
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Protein degraders 
extend their reach 


Claire Whitworth & Alessio Ciulli 


Molecules have previously been made that induce protein 
destruction inside cells. A new class of molecule now induces 
the degradation of membrane and extracellular proteins — 
opening up avenues for drug discovery. See p.291 


Most drugs act by binding to a specific site in 
a target protein to block or modulate the pro- 
tein’s function. The activity of many proteins, 
however, cannot be altered in this way. An 
emerging class of drug instead brings proteins 
into proximity with other molecules, which 
then alter protein function in unconven- 
tional ways! °. One such approach uses drug 
molecules called protein degraders, which 
promote the tagging of proteins with ubiq- 
uitin, another small protein. Tagged proteins 
are then broken down into small peptide mol- 
ecules by the cell’s proteasome machinery. 
But because the ubiquitin-mediated degra- 
dation pathway occurs inside the cell, protein 
degraders developed so far attack mainly 
intracellular targets. On page 291, Banik et al.* 
now reporta different mechanism that opens 
up extracellular and membrane-bound pro- 
teins for targeted degradation. 

The authors report protein degraders 
that they call lysosome-targeting chimaeras 
(LYTACs), which are bifunctional (they have 
two binding regions; Fig. 1). One end carries an 
oligoglycopeptide group that binds to atrans- 
membrane receptor (the cation-independent 
mannose-6-phosphate receptor; CI-M6PR) at 
the cell surface. The other end carries either 
an antibody or asmall molecule that binds to 
the protein targeted for destruction. These 
two regions are joined by a chemical linker. 

The formation of a trimeric CI-M6PR- 
LYTAC-target complex at the plasma 
membrane directs the complex for 
destruction by protease enzymes in mem- 
brane-enclosed organelles called lysosomes. 
LYTACs are conceptually related, but comple- 
mentary, to proteolysis-targeting chimaeras’ 
(PROTACs) — another bifunctional class of 
protein degrader that mainly targets intra- 
cellular proteins by recruiting them to E3 


ligases (the enzymes that tag proteins with 
ubiquitin). 

Banik et al. began by making LYTACs of 
varying size and linker composition, and 
which used a small molecule called biotin 
as the protein-binding component — bio- 
tin binds with exceptionally high affinity to 
avidin proteins. The authors observed that 
these LYTACs rapidly shuttled an extracellu- 
lar fluorescent avidin protein to intracellular 
lysosomes ina way that required engagement 
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with CI-M6PR. When the authors replaced 
biotin with an antibody that recognizes 
apolipoprotein E4 (a protein implicated in 
neurodegenerative diseases), this protein was 
also internalized and degraded by lysosomes. 
LYTACs can, therefore, repurpose antibodies 
from their normal immune function to 
direct extracellular proteins for lysosomal 
degradation. 

Next, Banik et al. investigated whether 
LYTACs could induce the degradation of 
membrane proteins that are targets for drug 
discovery. In several cancer cell lines, LYTACs 
did indeed induce the internalization and lyso- 
somal degradation of the epidermal growth 
factor receptor (EGFR) —a membrane protein 
that drives cell proliferation by activating a 
signalling pathway. Depletion of EGFR levels 
by LYTACs in the cancer cell lines reduced 
signal activation downstream of EGFR, com- 
pared with the amount observed when EGFRs 
were blocked by antibodies alone. This result 
confirms a previously reported° advantage 
of using target degradation in therapeutic 
applications, rather than target blocking. 

Similar outcomes were observed with 
LYTACs for other single-pass transmembrane 
proteins (proteins that span the cell mem- 
brane only once), including programmed 
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Figure 1 | Mechanism of action of lysosome-targeting chimaeras (LYTACs). Banik et al.‘ report 

LYTAC molecules, which consist of an oligoglycopeptide group (which binds to a cell-surface receptor, 
CI-M6PR) and an antibody that binds to a specific transmembrane or extracellular protein. The antibody 
can also be replaced by a small protein-binding molecule (not shown). On simultaneously binding to both 
CI-M6PR and the target protein, the resulting complex is engulfed by the cell membrane, which forms a 
transport vesicle. This carries the complex to a lysosome (an organelle that contains protein-degrading 
enzymes). The protein is degraded and the receptor is recycled; it remains to be seen whether the LYTAC is 
also degraded. LYTACs are potentially useful for therapeutic applications. 
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death ligand 1 (PD-L1), which helps cancer cells 
to evade the immune system. The next step 
will be to establish whether LYTACs can also 
induce the degradation of multi-pass proteins 
that span the membrane several times, such as 
the ubiquitous G-protein-coupled receptors 
and proteins that transport materials across 
membranes (ion channels and solute-carrier 
proteins, for example). If so, it will be interest- 
ing to compare the performance of LYTACs, 
which would bind to the extracellular domains 
of such proteins, with that of PROTACs, which 
can bind to the intracellular domains of these 
proteins (as was recently demonstrated® for 
solute-carrier proteins). 

As with any new drug modality, there is 
scope for improvement. For example, Banik 
and colleagues’ first PD-L1-targeting LYTACs 
produced only partial degradation of the 
protein, which the authors attributed to 
low expression of CI-M6PR in the cell lines 
used. When the authors made a second type 
of LYTAC that incorporated a more potent 
PD-L1 antibody, degradation increased, 
albeit in cells that expressed greater levels 
of CI-M6PR than did the original cell lines. 
This shows that low abundance of the lys- 
osome-shuttling receptor hijacked by the 
LYTAC (in this case, CI-M6PR) can reduce the 
effectiveness of these degraders. Similarly, 
the loss of core components of E3 ligases is a 
common mechanism by which cells become 
resistant to PROTACs’. Lysosome-shuttling 
receptors other than CI-M6PR could be used 
by LYTACs as alternatives, should resistance 
emerge. Degraders that target cell-type-spe- 
cific receptors might also have improved 
safety profiles compared with conventional 
small-molecule therapeutics, which are not 
always cell-type selective. 

What sets PROTACs and LYTACs apart from 
conventional drugs is their mode of action. 
For example, after a PROTAC has brought 
about the destruction of a target protein, the 
PROTAC is released and can induce further 
cycles of ubiquitin tagging and degrada- 
tion, thereby acting as a catalyst at low con- 
centrations’*. Mechanistic studies are now 
warranted to determine whether LYTACs also 
work catalytically. 

Another aspect of the mode of action of 
both PROTACSs and LYTACs is that they bring 
two proteins together, to forma trimeric com- 
plex. A general feature of such processes is 
the hook effect, whereby trimer formation, 
and thereby the associated biological activity, 
decreases at high drug concentrations. This 
is because dimeric complexes generally form 
preferentially at high drug concentrations — 
an undesirable effect that can be alleviated by 
ensuring that all three components interact 
in such a way that trimer formation is more 
favourable than is dimer formation’. 

Kinetics also matters for protein degraders. 
For example, stable and long-lived trimeric 
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complexes that involve PROTACs accelerate 
target degradation, improving drug potency 
and selectivity®. It will be crucial to understand 
how the complexes formed by LYTACs can be 
optimized to improve degradation activity. 

PROTACS and LYTACs are larger molecules 
than conventional drugs. As a result of their 
size, PROTACs often do not permeate well 
through biological membranes, which can 
make them less potent drugs than the biolog- 
ically active groups they contain. Size should 
be less of a problem for LYTACs because they 
do not need to cross the cell membrane, 
although they would still need to pass through 
biological barriers to combat diseases of the 
central nervous system. The development of 
lysosomal degraders that are smaller and less 
polar than LYTACs — and therefore more able 
to pass through membranes — will be eagerly 
anticipated. Small ‘glue’ molecules that bind 
to E3 ligases can already do the same job as 
PROTACS?. 

Targeted protein degradation is a promising 
therapeutic strategy, and the first PROTACs 
are currently in clinical trials’®. LYTACs will 
need to play catch-up, but they have earned 
their place as a tool poised to expand the 
range of proteins that can be degraded. 
Their development as therapies will require 
an understanding of their behaviour in the 
human body — their pharmacokinetics, toxic- 
ity, and howthey are metabolized, distributed 
and excreted, for example. It can be challeng- 
ing to optimize the biological behaviour of 
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molecules that incorporate large groups, such 
as antibodies and oligoglycopeptides, during 
drug discovery, but this problem can be over- 
come by further engineering the structures 
of these groups”. Banik and colleagues’ new 
approach to degradation therefore warrants 
an all-hands-on deck approach. 

Scientists working in drug discovery will 
eagerly await the development of LYTACs 
and the emergence of other methods for the 
drug-induced degradation of proteins”. Is no 
protein beyond the reach of degraders? 


Claire Whitworth and Alessio Ciulli are in 
the Division of Biological Chemistry and 
Drug Discovery, School of Life Sciences, 
James Black Centre, University of Dundee, 
Dundee DD1 5EH, UK. 

e-mail: a.ciulli@dundee.ac.uk 
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Rethinking extinctions 
that arise from habitat loss 


Joaquin Hortal & Ana M. C. Santos 


Does the loss of species through habitat decline follow the 
same pattern whether the area lost is part ofa large or a small 
habitat? An analysis sheds light on this long-running debate, 
with its implications for conservation strategies. See p.238 


Understanding how habitat size affects the 
abundance of all the species living ina com- 
munity provides ecological insights and is 
valuable for developing strategies to boost 
biodiversity. On page 238, Chase et al.! report 
results that might help to settle a long-running 
debate about the relationship between the 
area of a habitat and the diversity of species 
it can host. 

Land transformation by human activity is a 
major component of global change. The loss 
of natural habitats reduces the local diver- 
sity and abundance of species”, and has been 
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implicated in more than one-third of animal 
extinctions worldwide between 1600 and 1992 
(ref. 3). A report from the Intergovernmental 
Science-Policy Platform on Biodiversity and 
Ecosystem Services estimates that currently 
more than half a million species — about 9% of 
allterrestrial species — might lack the amount 
of habitat needed for their long-term sur- 
vival*. Moreover, their disappearance would 
compromise many key ecosystem services, 
such as pollination or the control of pests or 
disease-causing agents. 

The effect of habitat loss on biodiversity has 


been conventionally estimated on the basis 
of the relationship between area and species 
richness, which was first described more than 
150 years ago®. This seemingly universal rela- 
tionship is simple: the larger a given habitat’s 
area, the more species it holds, although the 
number of species increases with area ina 
nonlinear way®. There is a limit to the number 
of individuals of ecologically similar species 
that can persist in an area, owing to the limited 
resources that it harbours’. When a habitat 
loses part of its area, therefore, for many 
species, it also loses its capacity to support 
populations that are large enough to be viable. 
These species become extinct as habitat area 
diminishes with land-use intensification®. 

Chase and colleagues propose an elegant 
and simple approach to account for the 
dynamics of communities occupying habitat 
patches of different size. Rather than consider- 
ing only the overall number of species in each 
habitat fragment, the authors focused on the 
number and relative abundance of different 
species in samples obtained from such frag- 
ments. This allows the structure of ecological 
communities to be compared directly’, while 
avoiding problems that can arise when tak- 
ing into account the differences in the effort 
needed to sample large and small areas’. The 
authors’ approach also allows a comparison 
of variations in the relative abundance of indi- 
viduals of all species, ameasure of community 
structure that is associated with ecosystem 
dynamics”. 

Thanks to this method, Chase et al. could 
distinguish between three patterns of change 
that might occur as an outcome of habitat loss 
(Fig. 1). Inthe pattern described by the ‘passive 
sampling’ model, the structure of the com- 
munity remains the same in large and small 
fragments. Therefore, each sample provides 
similar species richness (the number of spe- 
cies), abundance (the number of individuals) 
and evenness (the allocation of individuals 
to the different species), regardless of the 
total habitat size. In this case, species decline 
will mirror the loss of habitat area under the 
classical species—area theory’, and the total 
number of species in the entire fragment 
would depend solely on its size. 

The other two patterns are described as 
types of ecosystem decay — a hypothesis pro- 
posing that a habitat that shrinks undergoes 
a disproportionately high loss of organisms 
compared with the loss of habitat area. One 
type of ecosystem decay is proposed to occur 
owing to excessive loss of individuals. Smaller 
habitat fragments will contain fewer individ- 
uals per sample than will larger ones, and all 
species are equally affected. This generates 
communities with fewer species in smaller 
fragments, but no changes in the relative 
abundance of species per sample between 
small and large fragments. 

The other type of ecosystem decay occurs 


a Large ecosystem 


i 
iii 


(i! 


\ 


Passive sampling 


Species 


b Smallecosystem 


Ecosystem decay 
(evenness) 


Ecosystem decay 
(individuals) 


Abundance per sample 


= F = 


Abundance per sample 


Figure 1 | Assessing how habitat size affects ecosystem dynamics. Understanding the relationship 
between a decline in habitat area and the effect on species is crucial for designing conservation strategies. 
a,b, Chase et al.' analysed studies that sampled species in particular habitats. The authors compared the 
diversity of organisms, such as insects, in samples obtained from large ecosystems (a) with samples taken 
from the same sampling area in a smaller fragments of the same type of habitat (b). These graphs show 
hypothetical results for species abundance per sample, and different species are shown in different colours. 
This method enabled the authors to distinguish between three possible outcomes as habitats become 
smaller. In the passive-sampling model, species are equally distributed in habitat fragments of any size, so 
the richness, abundance and relative species prevalence (evenness) per sample is constant, regardless of the 
total habitat size. In the ecosystem-decay (individuals) model, samples from smaller fragments have fewer 
individuals and species per sample than do samples from larger fragments, and all species abundances 
decline ina similar way as habitat is lost. In the ecosystem-decay (evenness) model, species vary in their 
response to habitat loss, and there is a change in their relative abundances. Chase et al. find that ecosystem 
decay, usually following the evenness model, is the best match for the observed data. 


owing to uneven changes in relative species 
abundances coupled to species loss. In this 
scenario, the species present have different 
responses to habitat loss, and therefore spe- 
cies become relatively more or less abundant 
insmaller fragments than in larger fragments. 
Their relative abundance becomes more 
uneven in samples from smaller fragments 
as some species increase their numerical 
dominance, impoverishing the community 
and causing it to become species poor. 
Using data from around 120 human- 
transformed landscapes worldwide, Chase 
et al. show that, in general, samples from 
small fragments of natural habitat have fewer 
individuals, fewer species and a more uneven 
abundance of species than samples taken 
from larger fragments do. This outcome is 
consistent with a generalized pattern of eco- 
system decay, mainly as a result of a decline 
in evenness (see Fig. 1), and this result holds, 
regardless of the type of habitat or organ- 
ism studied. This implies that the alteration 
of natural habitats causes major functional 
changes in ecosystem dynamics that go 
beyond simply losing populations and species. 
Therefore, current estimates of extinctions 
associated with habitat loss made on the 
basis of the passive-sampling model might 
be underestimating not only the number of 
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species that are threatened or already gone, 
but also the consequences of their loss for 
ecological functioning and the provision of 
ecosystem services. 

Changes in biodiversity after habitat loss 
alter many ecological processes", eventually 
causing catastrophic effects that accelerate 
the extinction process”. But local extinctions 
are often not immediate. Some species persist 
with reduced abundances and declining pop- 
ulation dynamics — knownas‘extinction debt’ 
— that lasts until the final individuals perish”. 
This causes an uneven distribution of species 
abundance that is vividly demonstrated by 
Chase and colleagues’ method. Their analysis 
reveals a few ‘winning’ species that dominate 
the community in small habitats, and a very 
large number of rare species, many of which 
are probably heading towards extinction. 

Declining species can be replaced by others 
coming from the neighbouring human-altered 
landscape, particularly in habitat edges“, pro- 
ducing what are described as ‘edge effects’ 
that are comparatively more important 
in smaller fragments. Indeed, in the early 
stages of land transformation, communities 
in small fragments are more different from 
pristine communities than are those in large 
fragments, with communities in small frag- 
ments becoming more similar to those in large 


Nature | Vol 584 | 13 August 2020 | 195 


News & views 


fragments over time, as they recover from the 
effect of land transformation’. According to 
Chase and colleagues, the degree of decay 
in diversity and species abundance found 
between large and small fragments is smaller 
in the older or ‘softly’ transformed European 
landscapes than in the more recently and 
dramatically transformed North American 
ones. This indicates that, over time, species 
moving in from the edges of the human-al- 
tered habitats might compensate, at least in 
part, for the ecological functions carried out 
by native species in larger habitats, causing 
small fragments to reach a new — yet different 
— ecological balance. 

Although this work underscores the key 
role of habitat area in maintaining ecosystem 
processes, there is little exploration of how 
these processes are altered by habitat loss. 
Species from higher trophic levels (the upper 
levels of the food chain), such as predators, 
require larger areas to maintain their popu- 
lations compared with species from lower 
trophic levels, so the number of individuals 
supported by smaller habitat fragments might 
not suffice to maintain populations of top 
predators or consumers, and hence would 
produce shorter food chains and alter the 
ecosystem structure”. Differences in extinc- 
tion rates between trophic levels can cause 
striking changes in ecosystem functioning at 
habitat edges”®, jeopardizing the functioning 
and ecosystem-service provision as natural 
habitats diminish in size”. 

Chase and colleagues’ results call for a 
reconsideration of the debate over whether 
a single large area devoted to conservation 
would preserve more species than would 
several small ones that combine to make 
up the same total size’. Some current evi- 
dence suggests that one continuous habitat 
might host fewer species than do many small 
patches that total the same area’®. However, 
the large ecological changes that these 
small fragments might undergo could end 
up resulting in massive reductions in eco- 
system function and, ultimately, increased 
extinction rates of native species over the 
long term compared with the case for a sin- 
gle, large protected area. 

Chase and colleagues’ approach is good for 
providing a general overview of the extent of 
these effects, but to understand exactly how 
ecological processes are changing locally, a 
higher level of detail will be needed. This 
will require going beyond the studies of 
trophic chains"”* to assess more-complex 
food webs®, and to gather information on 
changes in species’ functional responses and 
trait diversity in increasingly smaller habitats. 
Ultimately, this information will reveal which 
ecological processes are decaying, and what 
the consequences of such ecosystem decay 
are for the maintenance of fully functional 
biodiversity. 
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Stretch exercises for 
stem cells expand the skin 


Matthias Ribsam & Carien M. Niessen 


Stretching the skin of mice reveals that mechanical strain 
is communicated by asubpopulation of stem cells that 
proliferate and promote mechanical resistance, and so 


generate extra skin. See p.268 


The cells of our bodies are exposed to a range 
of mechanical forces — including compres- 
sion, shear and stretching — that they must 
resist to maintain tissue integrity and func- 
tion. For example, skin responds to stretch- 
ing forces by expanding. Physicians have 
exploited this particular response for more 
than 60 years’, implanting stretching devices 
in the skin to cause tissue expansion for plastic 
surgery or to repair birth defects’. But exactly 
how mechanical strain creates extra tissue in 
a living organism has not been known. On 
page 268, Aragona et al.* now provide com- 
pelling insights (at the molecular, single-cell 
and cell-population level) into how stem cells 
in the skin of mice sense and communicate 
stretch to make new tissue. 

The surface of the skin — a multi-layered 
tissue called the epidermis — protects 
organisms against dehydration and environ- 
mental stresses, including mechanical 
challenges. To ensure lifelong protection, the 
epidermis is constantly renewed through the 
generation of new stem cells in its basal layer. 
This renewal is balanced with differentiation 
andthe movement of stem cells to generate the 
upper, barrier-forming layers of the epidermis. 
Ultimately, the barrier-forming cells are shed 
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from the surface, to be replaced by new cells. 

Aragona et al. set out to examine how the 
epidermis responds to strain. The group 
positioned a device used in human surger- 
ies — aself-inflating gel — under the skin of 
mice. They then examined indicators of force 
perception, including changes in cell shape, 
the structure of a mechanosensitive protein 
called a-catenin, and a network of keratin 
proteins that provides cells with mechanical 
resilience. This analysis revealed that epider- 
mal stem cells do indeed sense and respond 
to strain. The authors observed a temporary 
increase in stem-cell division, followed by 
thickening of the epidermis. Thus, increased 
stem-cell renewal fuels stem-cell differentia- 
tion. The two effects combine to maintain a 
functional barrier at the same time as extra 
skin is generated. 

The researchers next genetically 
engineered cells in the basal epidermal layer 
such that the stem cells and their descend- 
ants were fluorescently marked. Tracking of 
these cell lineages over time confirmed that 
stretching tips the renewal-differentiation 
balance in favour of making more stem cells. 
This explains why the epidermis expands in 
response to stretching. 


https://doi.org/10.1038/d41586-020-02158-y 
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Stretch exercises for 
stem cells expand the skin 


Matthias Ribsam & Carien M. Niessen 


Stretching the skin of mice reveals that mechanical strain 
is communicated by a subpopulation of stem cells that 
proliferate and promote mechanical resistance, and so 


generate extra skin. 


The cells of our bodies are exposed to a range 
of mechanical forces — including compres- 
sion, shear and stretching — that they must 
resist to maintain tissue integrity and func- 
tion. For example, skin responds to stretch- 
ing forces by expanding. Physicians have 
exploited this particular response for more 
than 60 years!, implanting stretching devices 
inthe skin to cause tissue expansion for plastic 
surgery or to repair birth defects’. But exactly 
how mechanical strain creates extratissueina 
living organism has not been known. Writing 
in Nature, Aragona et al. now provide compel- 
ling insights (at the molecular, single-cell and 
cell-population level) into how stem cells inthe 
skin of mice sense and communicate stretch 
to make new tissue. 

The surface of the skin — a multi-layered 
tissue called the epidermis — protects 
organisms against dehydration and environ- 
mental stresses, including mechanical 
challenges. To ensure lifelong protection, 
the epidermis is constantly renewed through 
the generation of new stem cells in its basal 
layer. This renewal is balanced with differ- 
entiation and the movement of stem cells to 
generate the upper, barrier-forming layers of 
the epidermis. Ultimately, the barrier-forming 
cells are shed from the surface, to be replaced 
by new cells. 

Aragona et al. set out to examine how the 
epidermis responds to strain. The group 
positioned a device used in human surgeries 
—aself-inflating gel — under the skin of mice. 
They then examined indicators of force per- 
ception, including changes in cell shape, the 
structure of amechanosensitive protein called 
a-catenin, and a network of keratin proteins 
that provides cells with mechanical resilience. 
This analysis revealed that epidermal stem 
cells do indeed sense and respond to strain. 


The authors observed a temporary increase 
in stem-cell division, followed by thickening 
of the epidermis. Thus, increased stem-cell 
renewal fuels stem-cell differentiation. The 
two effects combine to maintain a func- 
tional barrier at the same time as extra skin 
is generated. 

The researchers next genetically engineered 
cells in the basal epidermal layer such that the 
stem cells and their descendants were fluores- 
cently marked. Tracking of these cell lineages 
over time confirmed that stretching tips the 
renewal-differentiation balance in favour of 


a Normal 


making more stem cells. This explains why the 
epidermis expands in response to stretching. 
Aragona et al. demonstrated that force 
changes stem cells at the molecular level in 
several ways. First, stretching increased the 
expression of genes involved in cell-cell 
adhesion, which have been shown to commu- 
nicate force in vitro*. Second, expression of 
components of the actomyosin cytoskeleton — 
anetwork of protein filaments that generates 
contractile forces in cells* — was increased. 
Third, stretching promoted signalling 
through the EGF—-Map kinase-ERK pathway 
(acascade of proteins that promotes growth). 
The researchers also assessed changes in chro- 
matin, the DNA-protein complex that parcels 
up the genome in cells; such changes can 
lead to altered gene expression. This analysis 
revealed that stretch induced the expression 
of a network of regulatory genes that links 
stem-cell proliferation to skin maintenance. 
The authors then examined howstrain alters 
gene-expression profiles of single epidermal 
stem cells, by sequencing the cells’ RNA. This 
revealed that only a subpopulation of stem 
cells undergoes the molecular changes asso- 
ciated with a stretched state. Why might this 
be? Perhaps those that take on the stretched 
state experience greater force. Alternatively, 
maybe stem cells exist in varying biochemical 
states, and thus are more or less sensitive to 
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Figure 1| How skin stem cells respond to stretching. a, The surface of skin is a multi-layered tissue called 
the epidermis, which has stem cells in its basal layer. Like all cells, the stem cells have a contractile network 

of protein filaments called the actomyosin cytoskeleton, and express the transcription factors YAP, TAZ 

and MAL. b, Aragona et al.’ placed an expanding gel under the skin of mice. They report that, ina subset of 
epidermal stem cells, the actomyosin cytoskeleton is reorganized. This somehow triggers movement of YAP, 
TAZ and MAL to the nucleus. The proteins induce gene-expression changes that promote an increase in both 
stem-cell proliferation and differentiation into cells that move into the upper layers of the epidermis. This 
dual response leads to expansion of skin tissue without compromising the barrier function of the epidermis. 
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force. Or local differences in stem-cell shape 
and mechanics could determine how each cell 
responds to stretch. Answering this question 
will require measurements of cellular forces 
and stiffness in vivo, which is still a major chal- 
lenge. In addition, it remains unclear whether 
the stretched stem cells alone are driven to 
proliferate — or whether these cells then 
induce expansion of surrounding stem cells. 

Aragonaet al. next genetically engineered 
mice to lack Diaph3 and Myh9, genes involved 
in regulation of the actomyosin cytoskeleton. 
Without these genes, stem-cell responses to 
stretch were absent, leading to a barrier defect 
in the animals. The group observed similar 
effects in animals engineered to lack the genes 
encoding YAP and TAZ, and/or in which MAL 
was inhibited — these three transcription fac- 
tors normally move to the nucleus to regulate 
gene expression in response to mechanical 
signals®. Next, the authors examined YAP, TAZ 
and MAL in mice lacking Diaph3 and Myh9. 
The transcription factors did not move to the 
nucleus in response to stretch in these animals. 
Thus, in normal skin, stretch reorganizes the 
actomyosin cytoskeleton to promote entry 
of YAP, TAZ and MAL into the nucleus. These 
proteins then coordinate transcriptional pro- 
grams that promote skin growth and barrier 
formation (Fig. 1). 

Finally, the researchers inhibited MAL or the 
EGF-pathway component ERK in their animals. 
Inhibition of either protein blocked stem-cell 
proliferation, but only MAL inhibition led to 
loss of the ‘stretched’ molecular state ina sub- 
set of stem cells. Thus, MAL regulates variable 
cell response to strain, whereas both ERK and 
MAL are necessary to promote the self-renewal 
of stem cells. Whether ERK is downstream of 
YAP, TAZ and MAL, or directly activated by 
the cytoskeleton’, and whether EGF-ERK 
signalling promotes adaptation to strain in 
the upper epidermal layers to maintain barrier 
function during skin expansion’, remain open 
questions. 

Overall, Aragona and colleagues’ data 
support a model in which stretch is initially 
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sensed by a subset of stem cells. These cells, 
through cytoskeletal reorganization and 
changes in gene expression, coordinate 
stem-cell renewal and differentiation with 
adaptation to the mechanical force being 
experienced. This response guarantees that 
the skin can maintain its protective function 
while expanding. 

The research opens several avenues for 
future research. First, what is the contribution 
of other compartments of the skin (such as 
the upper, barrier-forming epidermal layers, 
or the thick dermal layer that underlies the 
epidermis) in sensing and communicating 
stretch? The authors’ analysis of mice lacking 
YAP or MAL suggests that stretch also induces 
acytoskeletal response in differentiated cells 
of the upper epidermal layers. Stem-cell differ- 
entiation and upward movement can trigger 
renewal of neighbouring stem cells’, thus 
begging the question of whether the cells 
immediately above the basal layer are also 
required for stem-cell responses to strain. 

Second, in vitro experiments have demon- 
strated” that chromatin regulation in the 
nucleus is key to maintaining stem-cell iden- 
tity and genome integrity under mechanical 
stress. Aragona and co-workers’ skin-expander 
model will now allow us to explore these 
mechanisms in vivo. 

Third, current models of stem-cell renewal 
postulate that a single stem cell is equally 
capable of undergoing renewal or differen- 
tiation. However, Aragona and colleagues’ 
lineage-tracing experiments revealed that 
the number of cells derived from one stem 
cell (called basal-cell clones) tended to be 
even. This bias towards clones that have even 
numbers of cells became much more pro- 
nounced on stretching. How the stretched 
state promotes even-numbered clones is 
unclear. 

The authors propose that this bias can be 
explained by a model in which stem cells exist 
in two-progenitor units, in which one stem 
cell is committed to renewal and the other to 
differentiation. Communication within and 
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between units would balance the loss of cells 
through differentiation with renewal. The 
group performed a mathematical comparison, 
which indicated that the even-numbered-cell 
bias and clone dynamics they observed were 
more consistent with a two-progenitor than 
with a single-progenitor model. However, 
the jury on this is still out, because a recent 
study has provided fresh evidence for the 
one-progenitor model”. 

The current work provides a major step 
forward in our understanding of how force 
is interpreted at the single-cell level in living 
organisms. Furthermore, it should encour- 
age others to explore the use of mechanical 
signals to generate extra skin — not only for 
reconstructive surgery, but also for diseases 
associated with impaired regeneration. 
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organisms. Furthermore, it should encourage 
others to explore the use of mechanical signals 
to generate extra skin — not only for recon- 
structive surgery, but also for diseases asso- 
ciated with impaired regeneration. 
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Carbon loss from tropical 
soils increases on warming 


Eric A. Davidson 


Plots of tropical forest soils were warmed by 4 °C for two years 
to observe the effects on soil carbon emissions. The increase 
in efflux of carbon dioxide was larger than expected — a result 
with worrying implications for climate change. See p.234 


The carbon stored in soil could have a big 
impact on climate change. The global flux of 
carbon in and out of soils is six to ten times 
greater than the emissions of carbon dioxide 
produced asa result of human activities’”. The 
inputs of carbon to soils from plant detritus 
(dead wood, leaves and roots) roughly balance 
the losses to the atmosphere produced by the 
respiration of soil microorganisms that feed 
on that material. However, just a1% imbalance 
of global soil-carbon effluxes over influxes 
would equal about 10% of global anthropo- 
genic carbon emissions. Carbon in tropical 
soils was thought to be less vulnerable to loss 
under climate change than is soil carbon at 
higher latitudes, but experimental evidence 
for this was lacking. On page 234, Nottingham 
etal.* report that tropical-forest soils might be 
more vulnerable to warming than was thought. 

Various soil management practices (suchas 
changes of land use and of tillage methods) can 
influence the amount of soil carbon present’, 
but climate affects the respiration rate of the 
microbes that feed on soil carbon, and hence 
the CO, efflux from soil>. Ifthe net efflux of car- 
bon from soils to the atmosphere increases 
ina warming world, a positive feedback will 
accelerate the warming. 

Soils are amazingly diverse and differ in 
several respects that affect microbial respira- 
tion’. These include: the amount of minerals 
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to which soil carbon can bind and thereby be 
protected from decomposition by microbial 
enzymes; water content, which affects the 
diffusion of carbon to microbial enzymes; 


Figure 1 | Preparing to use a heating device in an experiment. Nottingham et al.’ buried heating 


the amount, timing and quality of the plant 
detritus going into the soil; and genetic var- 
iation in soil microbial communities. The 
intrinsic temperature sensitivity of microbial 
respiration reactions indicated by theory and 
demonstrated in the laboratory (where other 
factors are not limiting) therefore often var- 
ies from the apparent temperature sensitiv- 
ity measured in real-world settings**. Several 
in situ soil-warming experiments have yielded 
insights into the effects of temperature on 
CO, efflux from soil in temperate and boreal 
regions°”, but such researchis logistically more 
challenging to implement in tropical forests. 
Nottingham and co-workers now present 
results of a soil-warming experiment (Fig. 1). 
The authors placed warming rods around the 
perimeter of undisturbed soil plots in a trop- 
ical forest on Barro Colorado Island, Panama, 
and increased the temperature of the whole 
soil profile (to a depth of 1.2 metres) by 4 °C 
for two years. They measured CO, efflux using 
chambers periodically placed over the soil, 
and observed an unexpectedly large increase 
(55%) in soil CO, emissions. By excluding roots 
from the soil under some of the chambers, the 
authors determined that most of the increased 
CO, efflux was due toa greater-than-expected 
increase in the respiration of soil microbes. 
The dependence of reaction rates on 
temperature, including the rates of enzy- 
matic reactions, is described by the Arrhenius 
equation®. According to this equation, the 
fractional increase in reaction rate is less for 
a temperature increase of one degree Celsius 
at higher temperatures than at lower temper- 
atures. This suggests that the response of soil 
microbial respiration to temperature changes 


equipment in plots in a topical forest in Panama and monitored the affect of warming on carbon dioxide 


release from the soil. 
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should be less pronounced in tropical climates 
than it is in temperate or boreal climates. 
The Arrhenius relationship breaks down for 
enzymatic reactions at temperatures beyond 
those that are optimal for enzyme function, 
and therefore should not be extrapolated 
beyond temperatures that enzymes nor- 
mally experience. But such extreme temper- 
atures were not reached in Nottingham and 
colleagues’ experiment. 

Each enzyme-catalysed reaction has a 
characteristic activation energy (E,, simplis- 
tically defined as the minimum energy needed 
for the reaction to occur), but the £, values of 
the vast majority of respiratory enzymes fall 
within a fairly narrow range’, averaging about 
60 kilojoules per mol. If this common £, value 
is used in an Arrhenius function to calculate 
how the average microbial CO, efflux from 
Nottingham and colleagues’ plots varies with 
temperature (fitting the resulting curve tothe 
efflux recorded in the non-warmed plots), 
the observed efflux from the warmed plots is 
found to be much higher than the calculated 
value (Fig. 2a). The £, value would have to be 
an unlikely 97 kJ mol to make the Arrhenius 
function fit with the efflux observed from the 
warmed plots. 

A short-hand indicator of the temperature 
sensitivity of soil CO, efflux is Q,,, the multipli- 
cation factor of the increase observed when 
the temperature increases by 10 °C. The Q,, 
for microbial CO, efflux in Nottingham and 
co-workers’ experiment was 3.6, which is 
well above expectations for most respiratory 
enzymatic processes on the basis of common 
E, values and the Arrhenius equation (Fig. 2b). 
Temperature sensitivities as high as this indi- 
cate that the increased efflux in the warmed 
plots probably cannot be attributed only to 
the kinetic response of enzymes to warming 
— other confounding factors must also co-vary 
with temperature’. Indeed, in in vitro assays, 
Nottingham et al. report Q,) values of 2 or 
less for specific respiratory enzymes in soil 
samples from the studied plots. The authors 
rule out the possibility that significant con- 
founding effects arose during their warming 
experiment as a result of the modest changes 
insoil-water content and microbial carbon-use 
efficiency. The cause of the unexpectedly high 
observed apparent temperature sensitivity 
therefore remains a mystery. 

These results suggest that caution is needed 
when using only an Arrhenius or Q,) function 
to describe the temperature responses of soil 
microbial respiration — which most numerical 
models of Earth’s systems and their carbon 
cycling responses to climate change unfortu- 
nately do”. The Arrhenius function (where £, is 
about 60 kJ mol) predicts a modest decrease 
in Q,. from about 2.6 at temperatures typical 
of high-latitude ecosystems to 2.2 at temper- 
atures typical of tropical forests (Fig. 2b), 
whereas the current study and others?>° 
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Figure 2 | Theoretical and observed temperature sensitivity of CO, efflux from tropical soils. 

a, Nottingham et al.? observed an increase in CO, efflux from tropical soils warmed in situ from an ambient 
temperature of 26 °C to 30 °C (dark blue circles). The pale blue line shows the theoretical dependence 

of CO, efflux associated with soil microbial respiration, calculated using an Arrhenius function® (which 
describes the kinetic response of enzymes to temperature) and assuming that the activation energy (E,; 
simplistically, the minimum amount of energy needed for a reaction to occur) of the microbial respiratory 
reactions is 60 kilojoules per mol, which is typical for such reactions®. The observed increase in efflux on 
warming is larger than that predicted from this Arrhenius function. To fit the observations, £, would have 
to be 97 kJ mol (dark blue line), which is unlikely for respiratory enzymes. b, The Arrhenius function also 
predicts much lower Q,) values (red line; the multiplication factor of the increase in efflux observed when 
the temperature increases by 10 °C) than that observed by the authors at 30 °C (red circle). The authors’ 
results suggest that unknown factors co-vary with temperature to increase CO, efflux, and that the kinetic 
responses of respiratory enzymes, alone, are insufficient to account for their observations. 


report a broader range of observed Q,, values. 

Nottingham and colleagues’ results sug- 
gest that other factors affecting soil CO, 
efflux merit further study. Efflux responses 
to experimental warming often decline after 
a few years, perhaps because the readily 
decomposable components of soil carbon 
that act as substrates for respiration reac- 
tions become depleted, or because of shifts 
in microbial community structures®’. The 
Panamanian forest soil studied in the pres- 
ent work is relatively fertile as a result of the 


“Tropical forests are unlikely 
to continue indefinitely to 
becarbon sinks as the world 
warms.” 


historical deposition of volcanic ash; this 
soil fertility might have enabled high rates of 
inputs of plant detritus to the soil, preventing 
soil microbial respiration from being limited 
by the availability of substrates. Indeed, the 
authors report an increase in soil microbial 
biomass in the warmed plots compared with 
control plots, indicating that warming might 
have increased the total microbial enzymatic 
capacity — thereby increasing CO, efflux 
beyond the expected kinetic response of the 
pre-existing enzymatic capacity. 

Besides the clear take-home message that 
the responses of soil respiration processes 
under climate change should not be rep- 
resented in Earth-system models only by 


© 2020 Springer Nature Limited. All rights reserved. 


simple Q,, or Arrhenius functions, Nottingham 
and co-workers’ study adds to recently 
accumulating evidence that tropical forests 
are unlikely to continue indefinitely to be car- 
bon sinks as the world warms”. Tropical soil 
carbon does not receive as much attention as 
do the large and vulnerable soil-carbon stocks 
at high latitudes, which pose major concerns 
as a potential source of positive feedback to 
climate change”. But tropical-forest soils also 
contain substantial carbon stores that might 
be more vulnerable to loss in a warming world 
than was previously recognized. 
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The extreme astrophysical processes and conditions that characterize the early 
Universe are expected to result in young galaxies that are dynamically different from 
those observed today’ ®. This is because the strong effects associated with galaxy 
mergers and supernova explosions would lead to most young star-forming galaxies 
being dynamically hot, chaotic and strongly unstable’’. Here we report the presence 
of a dynamically cold, but highly star-forming, rotating disk ina galaxy at redshift® 
z=4.2, when the Universe was just 1.4 billion years old. Galaxy SPT-SJ041839-4751.9 is 
strongly gravitationally lensed by a foreground galaxy at z= 0.263, and it is a typical 
dusty starburst, with global star-forming’ and dust properties® that are in agreement 
with current numerical simulations’ and observations”. Interferometric imaging at a 
spatial resolution of about 60 parsecs reveals a ratio of rotational to random motions 


of 9.7 + 0.4, which is at least four times larger than that expected from any galaxy 
evolution model at this epoch’ > but similar to the ratios of spiral galaxies in the local 
Universe”. We derive a rotation curve with the typical shape of nearby massive spiral 
galaxies, which demonstrates that at least some young galaxies are dynamically akin 
to those observed in the local Universe, and only weakly affected by extreme physical 


processes. 


SPT-SJ041839-4751.9 (hereafter SPT0418-47) was observed with the 
Atacama Large Millimeter/submillimetre Array (ALMA) to image the 
thermal continuum emission from the galaxy’s dust and the emission 
from the 158-ym fine-structure line of ionized carbon [C 11] (Fig. 1a). 
The [C 11] line is typically the brightest fine-structure line emitted in 
star-forming galaxies, and it is the main gas coolant, coming from multi- 
ple phases of the inter-stellar medium, including the warm ionized, the 
warm and cold neutral atomic, and the dense molecular medium”. We 
applied athree-dimensional (3D) lens-kinematic modelling technique” 
to the interferometric data cube containing the [C11] line to determine 
the gassurface brightness distribution in each spectral channel (Figs. 1,2, 
Extended Data Figs. 1, 2, Extended Data Table 1; see Methods for full 
discussion), from which we derived the gas kinematics, and performed 
arobust dynamical analysis of the galaxy. In addition, we reconstructed 
the far-infrared surface brightness distribution of the heated dust 
emission from the interferometric continuum dataset. 

We find that the rotation curve of SPT0418-47 has the typical shape 
of abulge-dominated spiral galaxy in the local Universe”; it has a bump 
at 0.2 kpc from the galaxy centre and then declines before flattening 
at radii greater than 1.5 kpc (Fig. 2a, c). The [C 11] velocity dispersiono 
is well described by an exponential profile, with average velocities of 
~45 kms inthe inner regions (<1 kpc) and -18 kms ‘in the outer disk 
(21 kpc) (Fig. 2b, d, Extended Data Table 1). From a decomposition of 
the rotation curve, we derive the relative contribution of the different 
mass components—the stellar component, the gaseous disk and the 
dark-matter halo—to the total galactic gravitational potential (Extended 
Data Table 5, Fig. 2c; see Methods). 


SPT0418-47 has global physical properties (total stellar mass 
Mopar = 1.2°92 x 10'°M,, dark-matter mass Mpy= 1.7703 x 102M,, gas / 
fraction f,,, = 0.53°9.0¢ and stellar effective radius R, = 0.22"5o3kpc; 
uncertainties are the 16th and 84th percentile ranges of the probabil- 
ity distribution function of each parameter; M, is the mass of the Sun; 
see Extended Data Table 5 for further parameters) that are in agreement 
with the predictions from the most recent theoretical models’, as well 
as observations of the population of starburst galaxies at this redshift”®. 
From the delensed dust emission, we derive an intrinsic infrared lumi- 
nosity of (2.4 + 0.4) x10”L, (L., luminosity of the Sun) a star formation 
rate (SFR) of (352+65)M. yranda gas-depletion timescale of 38 +9 Myr, 
which are all typical of millimetre-selected starburst galaxies at this 
redshift"® (see Methods). 

Our high-resolution 3D kinematic analysis shows that SPT0418- 
47 has a ratio of rotational velocity (V) to velocity dispersion o of 
V/o=9.7 + 0.4 (Extended Data Table 2), whichis similar to that of spiral 
galaxies in the local Universe", but considerably larger than what is 
predicted by most numerical’? and analytical?** models. For example, 
for the majority of star-forming galaxies at z = 4 with stellar masses 
in the range 10°M, -10'°°M.,, the most recent cosmological magneto- 
hydrodynamical simulation TNGSO! gives V/o <3 (light-blue band in 
Fig. 3); even though such galaxies have rotationally supported gas-rich 
disks, they are dynamically hotter than their low-redshift counterparts. 
Complex astrophysical processes—such as stellar feedback by super- 
novae or radiation pressure, or active galactic nucleus (AGN) feedback 
from the central supermassive black-hole, galaxy mergers, and gas 
inflows and outflows—are expected to have a substantial effect on the 
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Fig. 1| [C11] emission from the lensed galaxy SPT0418-47 and source plane 
reconstruction. a, Emission of the 158-m fine-structure line of ionized 
carbon [CII] integrated acrossa velocity range of 721kms 7‘ (zeroth-moment 
map). The beam size, shownasa white ellipse on the lower left corner, is 

0.19” x 0.17” at a position angle of -85.22°. b,c, Asina, but the emissionis 
colour-coded by the flux-weighted velocity (b) and velocity dispersion (c) (first- 
and second-moment maps). d-f, Zeroth-, first- and second moment-maps of the 
reconstructed source. Ind the white contours are set at n= 0.05, 0.1, 0.2, 0.4, 
0.6, 0.8 times the value of the maximum flux of the zeroth-moment map. Ine 


10 


the black solid contours are at V,,, + AV, where V,, 


sys > 


ysis the systemic velocity 
V.,<=0 kms ‘and AV=40 kms”. We note that the scales of the second-moment 
maps (cand f) are different because c shows the observed values, whereas f 
shows the intrinsic ones (beam-smearing corrected). These six maps are 
intended only for visualization; the source reconstruction and its kinematic 
modelling are performed using the full 3D information of the data cube 
containing the [C 11] emission line (seeMethods and Extended Data Fig. 1for 
further details). dec., declination; RA, right ascension. 


Fig. 2 | Kinematic and dynamical properties of 
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obtained by error propagation from the1s.d. 
uncertainties of the parameters that define the 
respective profiles (see Methods and Extended Data 
Table 1). 
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Fig. 3 | Comparison between SPT0418-47 and samples of observed and 
simulated galaxies. Mean ratios of the rotational to random motion (V/o) of 
the gas versus redshift for the comparison samples””>* of observed 
star-forming galaxies indicated in the key and for SPT0418-47 (red square and 
empty red circle). The gas tracers are: H1 (ref. "), Ha (refs. 757°?9°), [O 11] (ref.°°), 
[O 111] (ref. 7°) and [C 1] (ref. 7”). For SPT0418, V/owas obtained from the [C 11] 
emission line. For the comparison samples, the shaded regions show the area 
between the 16th and 84th percentiles of the distributions, and the horizontal 
bars show the median values (where available). In Extended Data Table 3 we 
show the different extraction methods used to compute V/o. For the empty 
markers, the V/ovalues were calculated using for each galaxy a proxy for the 
maximum rotation velocity, Vinax. The V/o values shown by the full markers were 
calculated using the flat or the outer part of the rotation curve, Vj... The violet 
cross is alower limit for a single galaxy”. The light-blue band shows V/o for 
simulated galaxies from Illustris TNG5O' in the mass range 10°M,-10"M,. For 
these simulated galaxies, V/ois calculated as the ratio between the maximum 
rotation velocity, V,,,,,and the mean velocity dispersion, 0,,. The grey area 
shows the expected V/o for galaxies dominated by violent disk instabilities”. 
The black and green areas showa prediction and an assumption, respectively, 
from two different analytical models (Analytic 1* and Analytic 2°). The red 
square is the V,,,,/0 for SPT0418-47, and the empty red circle shows the 
position of the V/ovalue for SPT0418-47, calculated as the ratio between Vj, 
and the velocity dispersion at outer radii, 0,,,, (see Extended Data Table 2). The 
error bars for the red square and empty red circle show the 1s.d. uncertainties. 


gas kinematics within galaxies at this early epoch, and are predicted to 
be responsible for a progressive increase of chaotic, random motions 
with redshift’*». Our finding firmly rules out models in which a high 
star-formation feedback and a high gas fraction necessarily produce 
large turbulent motions" (light-blue, green and black bands in Fig. 3) 
and violent disk instabilities”’, resulting in dispersion dominated sys- 
tems with V/o <2 at these redshifts (grey band in Fig. 3; see Methods 
for further discussion about these comparisons). Our result requires 
galaxy evolution models to produce dynamically cold galaxies that are 
not characterized by large turbulent motions!“ and violent instabili- 
ties”?, even at early times. 

Our kinematic analysis also allows the level of axisymmetric disk 
instabilities to be measured within SPT0418-47 via the Toomre param- 
eter“ Q: a value of Q>1ensures that no instabilities will develop because 
large-scale collapse is prevented by differential rotation, whereas Q <1 
indicates that instabilities will be able to grow and lead to the creation 
of gas and star-formation clumps within the disk. For SPT0418-47, 
we find an average value of Q= 0.97 + 0.06 at a distance of R>1kpc 


(Fig. 2d), where the gas component is dominant (Fig. 2c), indicating a 
potentially unstable disk, prone to form clumpy star-forming regions. 
This result supports the hypothesis* that the irregular morphologies 
found for dusty starburst galaxies in the optical/ultraviolet rest-frame 
wavelengths” are poorly resolved clumpy star-forming regions, and 
not objects that are undergoing or have recently experienced a merg- 
ing event. 

Using our rotation-curve decomposition (see Methods), we find that 
the stellar component of SPT0418-47 is well described by a Sérsic pro- 
file with a Sérsic index of 2.233 and a stellar mass of 1.23? x 10°m,. 
Several observational studies of the structural properties” of galaxies 
have confirmed that the Hubble sequence is already in place at z= 2.5, 
with galaxies showing a large variety of morphologies. However, at 
z23, the lack of spatially resolved data in the rest-frame optical emis- 
sion for these galaxies has prevented the study of their structures and 
morphologies. The unprecedented spatial resolution of 60 pc of the 
dataset for SPT0418-47 allows one to study the morphological prop- 
erties of az=4 galaxy. The bump in the inner region of the rotation 
curve clearly indicates that a bulge is already in place at z= 4, whereas 
the Sérsic index of -2is a signature of either a disky bulge’ or an embed- 
ded disk-like component. 

Dusty starburst galaxies are believed to be the progenitors of 
early-type galaxies (ETGs), which are the most massive galaxies 
observed today, dominated by old stellar populations. The most pop- 
ular evolutionary track for this transformation?*”’ predicts that the 
dusty starburst phase is followed by a quenching phase, during which 
AGN feedback leads to gas consumption and heating, with the conse- 
quent formation of a population of compact quiescent galaxies”° at 
z= 2. Inthe final phase, minor dry mergers are expected to be respon- 
sible for a growth in galaxy size and the transformation into present-day 
ETGs. In Fig. 4a—d we compare the main physical quantities of SPT0418- 
47, as inferred from our dynamical model, with the corresponding 
quantities for the sample of local ETGs from the ATLAS” survey”. We 
consider only those local ETGs with estimated stellar ages of 212 Gyr 
(ref.”*), which is the look-back time corresponding toz=4. The orange 
thin diamond in each panel shows the values derived for SPT0418-47 
as observed today, and the red diamond shows the corresponding 
baryonic values (gas plus stars), under the assumption that all the gas 
that we observe today will be converted into stars, preserving the disk 
configuration. Given the SFR estimated for SPT0418-47, this conver- 
sion will happen in ~38 Myr (see Methods). The comparison between 
the ETGs and the stellar/baryonic quantities for SPT0418-47 in the 
size—mass plane (Fig. 4a) indicates that this starburst galaxy should 
increase its stellar mass by a factor of 6 (3 for the red diamond) and its 
effective radius by a factor of 11 (3 for the red diamond), in order to 
evolve into an average ETG (yellow cross). This is in agreement witha 
simple toy model” for mergers, in which a single major dry merger 
event would be responsible for an increase in both the size and the 
stellar mass of SPT0418-47 by a factor of 3. In Fig. 4a, the grey stars 
show the populations of compact galaxies” at 2 <z<3. Notably, the 
median size for this population is comparable to the size of SPT0418- 
47 (red diamond), but its median stellar mass is larger by a factor of 3. 
In addition, the position of SPT0418-47 onthe mass-size plane is com- 
patible with the low-mass end of both ETGs and compact z= 2 galaxies. 
This implies that either this galaxy will evolve smoothly into a low-mass 
ETG after the consumption and/or heating of the cold gas reservoir or, 
as predicted by the merger scenario, it will reach the bulk of the ETG 
population in the size-mass plane. Figure 4c shows that a potential 
progenitor of an ETG has already at z= 4 a disky stellar component, a 
feature that is very common especially amongst local fast-rotator 
ETGs'%, Finally, we derived the fraction of dark-matter mass within the 
effective radius and found that, witha value of £,,(R<R,) = 0.095'9:8o4 
(red diamond), the central regions of SPT0418-47 are dominated by 
baryons. As shown in Fig. 4c, sucha low fraction of dark matter is com- 
patible with observations of local ETGs™, implying that the physical 
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mechanisms responsible for the mass and size growth of this galaxy 
with cosmictime should preserve the dark-matter contribution within 
the innermost ~1 kpc. 
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Methods 


Observations 

SPT0418-47 was identified during the South Pole Telescope Survey” 
as a far-infrared luminous source with dust-like spectral indices at 
1.4 mmand 2.0 mm (ref. *”). Follow-up observations” with ALMA con- 
firmed SPT0418-47 to bea dusty starburst galaxy at redshift z=4.2248 
(ref. °) that is strongly gravitationally lensed by a foreground galaxy 
at z= 0.263. 

The data used for the analysis described here were taken from the 
ALMA Archive. SPT0418-47 was observed with ALMA on 5 July 2017 
under project code 2016.1.01499.S (Principal investigator, K. Litke). 
Observations were taken at a central frequency of 358 GHz with an 
antenna configuration with a maximum baseline of 1,400 m. The data 
were correlated with both linear polarizations (XX and YY), witha visibil- 
ity integration time of 6s, in four spectral windows, each with 240 chan- 
nels and 1.875 GHz bandwidth. The spectral windows were centred on 
350.8, 352.7, 362.8 and 364.6 GHz, and the last spectral window covers 
the redshifted rest frequency of the [C 11] line (v,.<¢=1,900.5369 GHz). 
JO519-4546 was observed to calibrate the flux density scale and J0522- 
3627 was used to correct the spectral bandpass. JO428-5005 was 
observed as asecondary check source. Phase switching toJ0439-4522 
was carried out at 7-min intervals to calibrate the complex gains result- 
ing from atmosphere-induced phase and amplitude fluctuations. The 
total on-source integration time was 21 min. 

We calibrated the raw visibility data using the ALMA pipeline in the 
Common Astronomy Software Applications (CASA) package®. The data 
were then inspected to confirm the quality of the pipeline calibration 
and that no further flagging was required. Phase-only self-calibration 
was performed on the continuum with a solution interval of 100 s to 
correct for residual phase errors. The complex gain corrections from 
the continuum were applied to the line spectral window. The line data 
were prepared by fitting a model to the line-free spectral windows 
and subtracting it from the visibilities, to produce a dataset with only 
the spectral line emission. To improve the signal-to-noise ratio, we 
averaged the data into groups of four velocity channels resulting in 28 
independent channels, each with a width of 25.7 kms. 

SPT0418-47 was imaged ona pixel scale of 0.03 arcsec per pixel with 
natural weighting of the visibilities, and deconvolved using CLEAN™. 
The zeroth-, first- and second-moment maps of the [C II] emission are 
shown in Fig. la—c; we note that these images are intended only for 
visualization, as all the modelling and analysis were performed onthe 
visibility data directly. Throughout the manuscript, we assume a ACDM 
cosmology, with Hubble constant H, = 67.8 (kms”)/Mpc, matter density 
0,, = 0.308, and vacuum energy density Q, = 0.691”. 


Lens and kinematic model 

We model the data cube containing the [C 11] emission line using a3D 
Bayesian lens—kinematic modelling technique that fits the data directly 
in visibility space (Extended Data Fig. 1). See Rizzo et al.” and Powell 
et al. for amore detailed description of the modelling approach, which 
is briefly summarized below. 

The method used for source reconstruction is grid-based: the back- 
ground source surface brightness distribution is reconstructed on 
a triangular adaptive grid’ defined by a Delaunay tessellation. The 
source grid automatically adapts with the lensing magnification, so 
that there is a high pixel density in the high-magnification regions, 
resulting in a spatial resolution of -40 pcin the inner regions and ~90 pc 
inthe outer regions for this system. The kinematics of the background 
galaxy are obtained by fitting the lensed data directly in a hierarchical 
Bayesian fashion, where a 3D kinematic model, s,,,, that describes a 
rotating disk is used as a regularizing prior for a pixellated source 
reconstruction. 

Taking advantage of the fact that gravitational lensing conserves 
surface brightness, and considering the observational noise n, the 


data in the visibility space d and the source surface brightness dis- 
tributions can be related to each other via the following set of linear 
equations: 


DLN, +=, (1) 


where L is the lensing operator, which is a function of the lensing param- 
eters Niens, and Dis the non-uniform discrete Fourier transform operator. 
Because both 7,,,;and s are unknown, equations (1) can not be simply 
inverted. We therefore fit the data within the framework of Bayesian 
statistics by using two levels of inference. 

At the first level of inference, it can be shown that given the data, a 
lens—mass model and a source kinematic model, the most probable 
aposteriorisource S,,p is obtained by solving the following set of linear 
equations: 


[(DL)'C "DL + AR" Rs = (DL) Cd + AR" RS kin, (2) 


where C tis the noise covariance and Ris the source regularization form 
with regularization strength A. 

At the second level of inference, the lens parameters 77,,,, the regu- 
larization strength A and the kinematic parameters 7,,;,, defining S,;, 
are obtained by maximizing the posterior probability distribution 
P(| Mens» Mens: A) defined as: 


2 log P(dNiens: Nin A)= 
=? -A(Syyp — Syin) 'R'R (Sup — Sxin) — log (detA) (3) 
+ log[det(AR"R)] + log[det(2mc7}] . 


In equation (3), x* = (DLs - d)'C (DLs - d) and the matrix A is defined 
asA=[(DL)'C'DL + AR'R]. 

Because of the large number of visibilities per channel (n,,,,,= 163,190), 
we replace the non-uniform discrete Fourier transform operator D 
with a non-uniform fast Fourier transform (FFT) operator?”*’, which 
first uses a gridding kernel to interpolate the visibilities onto a regular 
gridded u-v plane, then applies an FFT, and finally uses an apodization 
correction to remap the model and the data onto the original u-v sam- 
pling. Because the non-uniform FFT does not have an explicit matrix 
representation, the source inversion (equation (2)) is solved using a 
preconditioned conjugate gradient solver. We note that for the con- 
jugate gradient method we use a tolerance of 10”, which results in 
typical accuracies of 10° for the source inversions. 

As usually done when modelling galaxy-galaxy strong lensing obser- 
vations, the lens is described by a projected mass density profile with 
a cored elliptical power-law distribution plus the contribution of an 
external shear component of strength /,, and position angle 0,, (see 
Extended Data Table 1). This assumption has been shown to provide a 
good fit to large samples of known lenses, as discussed, for example, 
by Koopmansetal.”’ and Barnabé et al.*°. The dimensionless projected 
mass density profile is defined as 


3 


ko(2-5)q"? 


=e 
2[q?(x* + r2)+y7] 2 


K(x, y) = (4) 


where xk, is the normalization factor, gis the projected flattening, y is 
the density slope (a value of y = 2 corresponds to an isothermal pro- 
file), x) and yy define the centre of the mass distribution, r, is the core 
radius and @is the position angle of the major axis. The lens mass model 
parameters are in agreement within 2 s.d. with those derived from 
modelling the dust continuum“. 

The kinematic model s,,,is defined by the parameters describing the 
rotation velocity, the velocity dispersion profile and the geometry of 
the galaxy. We choose the multi-parameter function 


Article 


i; (5) 


as the functional form for the rotation velocity (see Extended Data 
Table 1). In equation (5), V,is the velocity scale, R, is the turnover radius 
between the inner rising and outer part of the curve, £ specifies the 
power-law behaviour of the curve at large radii and €defines the sharp- 
ness of the turnover. We choose the above multi-parameter function 
because it is flexible enough to reproduce the large variety of observed 
rotation curves, and therefore allows much more freedom than other 
typically used functions, for example, the arctangent. 

The velocity dispersion profile, o(R) is described by an exponential 
function, whichis more flexible than the more commonplace choice of 
aconstant value”°®. We also tested a linear function but found it to be 
significantly less favoured by the data, by a Bayes factor of 1.8 relative 
to the exponential model. 

For the geometry of the kinematic model, defined by the inclination 
(i) and position angle (PA), we assume that there is no radial variation 
(Extended Data Table 1). The surface density of the gas is nota free param- 
eter; instead, we impose a pixel-by-pixel normalization, which is given 
by the surface brightness distribution obtained from the lens modelling 
of the zeroth-moment map. The advantage of using this normalization 
is that it allows us to account for possible asymmetries in the gas distri- 
bution. The derivation of the lens mass model and the source kinemat- 
ics is done using a four-step optimization scheme, and the uncertainties 
onthe parameters are obtained from the posterior distributions calcu- 
lated with MULTINEST® (Extended Data Fig. 2), by adopting the 
user-defined tolerance, sampling efficiency and live points of 0.5, 0.8 
and 200. We then verified that the evidence estimated by MULTINEST 
flatten as a function of the prior volume. As described in our method 
paper”, for each parameter, we adopt priors that are flat in the intervals 
[Rtens/kin 5 0.2fens/kin’ Mens/kin % 0.2Mens/in where Mens/kin are the best- 
/fitting parameters, inferred from the nonlinear optimization. To be con- 
servative, we report as errors in the parameters the sum in quadrature of 
the following two contributions: the 1s.d. uncertainty on the posterior 
distributions derived by MULTINEST and the difference between the 
maximuma posteriori parameter values obtained by MULTINEST and by 
the nonlinear optimizer. These values of the uncertainties are in line with 
whatis expected from the tests presented in Rizzo etal.”, where we applied 
our methodology to data characterized by spatial and spectral resolutions 
of at least a factor of three worse than those analysed in this manuscript. 


Direct forward modelling 

Foraconsistency check, wealso derived the lens (7,.,,,) and kinematic (in) 
parameters by using a direct forward modelling methodology. The main 
difference to the methodology described in Rizzo et al.¥ and summarized 
aboveis that a parametric, rather than pixelated, source is used to fit the 
data. The parametric source is defined by the same 3D kinematic model, 
S,ine USed as a regularizing prior for the source in our fiducial method. In 
this implementation, to marginalize over 77, Nd Nin, We Maximize the 
posterior probability distribution (see equation (3) for comparison), 
defined as 2 log P(dli)j.nc° Uxin) = ~ (DLSxin ~ A) C (DLS yin — A). 

The parameters obtained with the forward modelling method are 
within the 1s.d. uncertainties with respect to those obtained with our 
fiducial model (see Extended Data Fig. 2). 

Inthe direct forward modelling method, the source is forced to have 
aregular configuration, chosen a priori using the assumptions on the 
kinematic parametrizations. As explained in detail in Rizzo et al., 
by assuming our fiducial methodology, we instead use the kinematic 
modelasa prior for the source reconstruction. The main advantage of 
the fiducial approach is that it does not directly impose a parametric 
model that may not bea good fit to the data. 


Beam-smearing correction 

In recent years, observational studies have used the V/o ratio to study 
the kinematic evolution of galaxy populations and distinguish between 
rotation- and dispersion-dominated systems. However, the measure- 
ment of the V/oratio is a challenge for high-redshift, marginally resolved 
studies because of the so-called beam-smearing effect®®. This effect can 
cause a strong overestimation of the velocity dispersion and underesti- 
mation of the rotation velocity. For galaxies in the redshift range of 1-3, 
there is no consensus on whether there is an increase of the V/o ratio 
with time” °°. For galaxies at z= 4, there have been very few attempts 
to go beyond the fitting of the integrated line profile***. However, 
marginally resolved observations and a low signal-to-noise ratio have 
prevented a measurement of abeam-smearing-corrected V/oratio. The 
combination of the high signal-to-noise ratio and the spatial resolution 
that is achievable thanks to gravitational lensing and to an intrinsically 
beam-smearing corrected kinematic analysis has allowed us to probe 
the intrinsic V/o ratio at this high redshift. Different definitions of the 
V/o ratio can be found in the literature” °°. Here we use the two most 
popular ones (see Extended Data Table 2). We obtain V/o=9.7 + 0.4 when 
we calculate V/aas the ratio between the maximum rotation velocity V,,,., 
andthe mean velocity dispersion o,,, whereas we get V/o=13.7+0.7 when 
we calculate it as the ratio between the flat part of the rotation velocity 
Vaa and the velocity dispersion o,,, at outer radii (R >1kpc). The values 
of V/ofor SPT0418-47 seem to indicate that there is no decrease of V/o 
with time. However, larger samples will be needed to confirm this trend. 


V/o for the comparison samples 

To determine the evolution of the dynamical properties of galaxies 
across cosmic time, different studies have used different gas tracers 
(see column 2 of Extended Data Table 3) and different extraction meth- 
ods to obtain their physical parameters. The comparison of V/o for 
SPT0418-47 with values derived from different gas tracers is based on 
the assumption that the ionized gas kinematics reflects the cold-gas 
motions of galaxies, as observed for a few galaxies***”. However, by com- 
paring the results obtained from different tracers and with different 
techniques, a recent study** found that velocity dispersions measured 
from ionized tracers tend to be ~10-15 kms higher than those meas- 
ured from molecular tracers. If we add this 15 kms7to our inferred [C 11] 
velocity dispersions, we find V,,,./Om = 6.6 And Vetae/Oexe = 7-8, which are 
still much higher than those observed and those predicted by galaxy 
evolution models. We also stress that whereas the above analysis mostly 
compared the molecular with the ionized gas tracers, in this study we 
focus on the [C 11] emission, which traces both cold and warm gas. In 
Extended Data Table 3, we showthe main assumptions made to extract 
the velocities and velocity dispersions that enter the calculation of the 
V/o ratio for the samples shown in Fig. 3. 


V/oin galaxy evolution models 

There is a general consensus” ” that galaxy disks at high redshift are 
much more turbulent than their local counterparts in all components 
(stars, warm and cold gas). In Fig. 3 we show the V/o predicted (or 
assumed) by current galaxy evolution models. 

The light-blue area shows the predicted evolution of V/ointhe TNGSO 
simulation’, where the gas kinematics is derived from Ha-emitting gas. 
However, because, by construction, the ionized and molecular gas of 
TNGSO galaxies have the same dynamics, we can compare these esti- 
mations with our measurement from the [C 11] emission line. TNGSO 
galaxies at z= 4 have median V,,,,,/0,, = 3 with a standard deviation of 
1.5, which means that SPT0418-47 is 4.5 standard deviations away from 
the median value. 

By using an analytical approach and a cosmological mesh-refinement 
simulation, two recent studies”” found that galaxies at z= 3 are domi- 
nated by violent disk instabilities, which lead to V/o values of <2 in all 
components (grey area in Fig. 3). 


By using an analytical model that combines stellar feedback and 
gravitational instabilities, Krumholzet al.‘ derived a prediction for the 
correlation between cold-gas velocity dispersion and SFR as 
SFR = 0.42fot¢ ¢asV70/ (TGQ nin) (equation (60) in ref. *), where we have 
used the constants that are appropriate for high-redshift galaxies***. 
By using fortgas = Seas X 1.5 = 0.53 * 1.5, SFR = 352M. yr}, V=259 kms, 
Qnin= 0.88, we derived a value of c= 183°33 km s_4,whichisa factor of 
~6 higher than our measured value of 32 kms and impliesV /o= 1.4704 
(black band in Fig. 3). 

Hayward and Hopkins used an analytical model to study the effects 
of stellar feedback in regulating star formation and driving outflow. In 
this case, the gas velocity dispersion is not a predicted quantity, but it 
is derived by assuming the following relation between oand the circu- 
lar velocity:o~f,,. V././2. For the values of the gas fraction and V. meas- 
ured for SPT0418-47 by our analysis, this analytical model implies 
o=120kms‘", whichis a factor of -4 higher than the measured value, 
resulting in V/o = 2.6. In Fig. 3 (green band), we show the predictions 
of V/ofor this model, obtained by using the relations of ref.° for veloc- 
ity versus stellar mass and the redshift evolution of the velocity disper- 
sion with respect to the gas fraction. 


Dynamical model and Toomre parameter 

Under the assumption that the total galactic gravitational potential Dis 
axisymmetric, the rotation velocity V,,,(R) of the gas, in cylindric coordinates 
(R, @, 2), is related to ® by the equation R(O@/OR),_9 = V2= V2.4 Vi, 
where V. is the circular velocity and V, is the asymmetric-drift correc- 
tion that accounts for the pressure support due to the random motions. 
Under the assumptions that the gas of the rotating disk has athickness 
that is independent of the radius®, it is thin and it has a spatial distribu- 
tion described by an exponential profile, 2,,, = 2)exp(-R/R,,;), the 
expression for V, is given by 


Vi=—RO70 In(07Eg9s)/OR =- R070 Info?exp(-R/Rgas)/OR. (6) 


To measure the scale radius R,,,, we divide the zeroth-moment map 
of the reconstructed source (Fig. 1d) into rings (with centres, posi- 
tion angles and inclinations defined by the values of the kinematic 
model; see Extended Data Table 1) and we calculate the surface den- 
sities at a certain radius as azimuthal averages inside that ring. The 
surface density profile obtained in this way is then fitted using the 
exponential profile, resulting in a value of R,,,= 0.9 kpc. The resulting 
asymmetric-drift correction, equation (6), gives asmall contribution 
(<1%) with respect to V,,,(R). 

To derive the contribution of the gas, stellar and dark-matter com- 
ponents to the total gravitational potential, we model the circular 
velocity as V.= | Vdtar+ Vas + Vom- Here Vicar is the stellar contribution 
to the circular velocity, under the assumption that this component is 
described by a Sérsic profile’, defined by the total stellar mass Mga, 
the effective radius radius R,.,and the Sérsic index n. V,,,is the gas con- 
tribution, under the assumption that the gas in this galaxy has a distri- 
bution described by an exponential profile, as traced by the [C 11] 
emission line. The scale length that enters V,,, is fixed at the value R,,. 
found as described in the previous section, and the only free param- 
eter of the fit for V,,, is the conversion factor (ajc) between the total 
[C 11] luminosity and the total gas mass. A number of recent studies 
have found that [C 11] is a good tracer of the total gas mass*®*”. For 
SPT0418-47, the [C11] luminosity is 1.8 x 10°L, obtained by computing 
the zeroth-moment map of the [C 11] emission as the signal integrated 
along the spectral axis at each pixel of the reconstructed source. The 
dark-matter contribution V,,,is modelled as a Navarro-Frenk-White® 
(NFW) spherical halo, with a concentration parameter of c=3.06. The 
latter is obtained by averaging the values of c at z=4.22 for dark-matter 
haloes with masses between 10"°M, and 10"M,, assuming the mass- 
concentration relation estimated by N-body cosmological simula- 
tions®’. We note that at this redshift, c is almost independent of the 


dark-matter halo mass, varying by just 6% for a variation of 3 orders of 
magnitude in the halo mass. To test the effect of our assumption of a 
constant concentration, we repeated the analysis with ac that is free 
to vary according to either a uniform prior or a Gaussian prior centred 
on the predicted mass-concentration relation, and found that all 
inferred dynamical parameters do not change considerably with c. 

We summarize our assumptions for the dynamical model in the sec- 
ond column of Extended Data Table 4, and the corresponding free 
parameters are shown in the third column of the same table. We com- 
pute the Bayesian posterior distribution of these parameters using 
DYNESTY, a python implementation of the Dynamic Nested Sampling 
algorithm® (see Extended Data Fig. 3 and Extended Data Table 5). We 
use log-uniform priors for the mass parameters and uniform priors for 
the scale radii (fourth column of Extended Data Table 4). The inferred 
Morar = 1.2782 *10'°M, is in excellent agreement with the value of 
9.5 + 3.0 x 10°M, found ina recent independent study’ by fitting the 
spectral energy distribution of this galaxy. For aj¢,, we employ a uniform 
prior inthe range corresponding to +3 standard deviations around the 
mean value of 30M./L., derived from a large sample of low- and 
high-redshift galaxies®*. We infer a value of ajc = 7.315 M./L., in agree- 
ment with other studies of z= 4 dusty starburst galaxies”. 

In Extended Data Table 5 we show some relevant physical quantities 
(see also Fig. 4d) derived from our dynamical analysis. In particular, for 
each of these we quote the 16th, 50th and 84th percentile, which were 
obtained from the full posterior sample points returned by DYNESTY. 

We also calculate the Toomre parameter” (Fig. 2d) using the general 
definition Q(R) = ox/(m1GZ,,,). Here, xis the epicycle frequency defined 


as K=./RdQ?/dR + 407, where Q= V,,,/R is the angular frequency. 


Comparison with local ETGs 

In Fig. 4 we show some physical quantities for local ETGs taken from 
several studies of the ATLAS? survey'®)7274°1.°2. For comparison with 
SPT0418-47, we show for each quantity an upper limit, calculated under 
the assumption that all gas is converted into stars, and the observed 
quantities (orange diamonds), which can also be interpreted as a lower 
limit if all the gas that we observe is ejected by feedback processes. 
However, we caution that this upper limit does not take into account 
the accretion of gas if no baryons are expelled by outflows. In this case, 
the red diamond could move farther towards the upper right cornerin 
the size—mass plane (panel a, Fig. 4). 


Dust continuum and SFR 

Using the parameters of the lens mass model shown in Extended Data 
Table 1, we perform a pixelated source reconstruction of the dust 
continuum at 160 pm (rest frame), which allows us to calculate a total 
magnification factor of = 32.3 + 2.5. SPT0418-47 has an observed 
(unlensed) infrared luminosity of Ljp.p;= (7.7 £1.3) x10¥L, (ref. %). This 
value was obtained froma spectral energy distribution fitting of seven 
photometric data points in the wavelength rest frame range 48-574 pm. 
Under the assumption that the morphology of the source is the same 
for all photometric points in the infrared band, we can use the magni- 
fication factor estimated from the emission at 160 pum to estimate an 
intrinsic luminosity of Lig = Ligops/H = (2.4 + 0.4) x 107L.. By assuming 
that the infrared emission comes from the thermal emission of dust, 
heated by the radiation field coming from young stars, we compute 
SFR = (352+ 65)M., yr". This value is derived fromL,, by applying a con- 
version factor of 1.48 x 10°M,L, ‘yr“, which is valid for a Kroupa initial 
mass function™. The gas-depletion timescale is M,,,./SFR =38 +9 Myr. 


Velocity dispersion in starburst galaxies 

By comparing the gas kinetic energy and the energy injected by stellar 
feedback, we determine whether the turbulence driven by supernova 
explosions can explain the velocity dispersion measured for SPT0418- 


47. The gas kinetic energy is given by Eyin= 3/2 Mgas Om* 5 * 10° 
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(see Extended Data Table 5). We estimate the energy injected by super- 
nova explosions as® Fone = NsnSFREsnEsnep- Here, Nsn is the supernova 
rate, €<, is the SFR efficiency (that is, the fraction of the SN energy, 
Es =10* erg, converted into turbulence) and ¢, is the dissipation rate 
of the turbulence. We assume 77, = 0.01 M," (ref. ®). For the highly 
uncertain parameter €,, we assume an upper limit of 0.1, based on 
high-redshift observations”. t, is calculated as t, =h/o,, where his the 
scale height of the gas, assumed to be -500 pc. Under this assumption, 
we estimate F.y. = 5 x 10 erg, in agreement with £,,,, suggesting that 
the velocity dispersion measured for SPT0418-47 can be explained as 
being produced by turbulence driven by stellar feedback. Furthermore, 
we note that starburst galaxies both at low®” and high” redshifts have 
values of velocity dispersions, measured from molecular or neutral 
gas, that are comparable with those found for SPT0418-47. 


Data availability 


This study used the ALMA data 2016.1.01499.S, available at http:// 
almascience.eso.org/aq/. 


Code availability 


The methodology used for the source reconstruction and its kinematic 
modelling is fully explained in ref. °. The code is available from the 
corresponding author upon reasonable request. 
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Extended Data Fig. 1| Reconstruction of the [C 11] emission and kinematic 
model. The rows show representative channel maps at the velocity shown on 
the upper left corner of column 4. Columns land 2 show the ‘dirty’ image of the 
data and the model, respectively, colour-coded by the flux in units of mJy per 
beam. Column 3 shows the residuals (data minus model) normalized to the 
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noise. Columns 5 and 6 showthe contours of the reconstructed source and of 
the kinematic model used to constrain the source reconstruction. The contour 
levels in the last columns are set at n=0.1, 0.2, 0.4, 0.6, 0.8 times the value of the 
maximum flux of the kinematic model. 
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Extended Data Fig. 2| Corner plot showing the posterior distributions of 
the lens and kinematic parameters. The dark and light areas inthe 
two-dimensional distributions show the 39% and 86% confidence levels, 
corresponding to1s.d.and 2s.d., respectively, obtained with the fiducial 


methodology described in Rizzo et al.'° (green) and with the direct forward 
modelling method (grey). From left to right, the first six panels show the lens 
parameters, and the other panels show the source kinematic parameters (see 
also Extended Data Table 1). 
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Extended Data Fig. 3| Corner plot showing the posterior distributions of 
the dynamical parameters. The posterior distributions are the results of the 
decomposition of the circular velocity (Fig. 2c) into the physical components 
contributing to the total gravitational potential: the stars, the gas disks and the 
dark-matter halo. The fitted parameters are the stellar mass M,,,,, the effective 
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radius, R., the Sérsic index, n (Sérsic profile), the mass of an NFW dark-matter 
halo, Mpy,and the conversion factor between the [C 11] luminosity and the gas 
mass, cy. The dashed lines in the one-dimensional histograms show the 16th, 
50th and 84th percentiles (see Extended Data Table 5). 


Article 


Extended Data Table 1| Lens and source kinematic parameters 


Lens parameters Kinematic parameters 

Ko (arcsec) 1.22 +0.04 i?) 5442 
6(°) 22.6+1.6 PA(’) -2743 
q 0.91 +0.02 V; (kms~') 245.1+0.8 
Y 2.05 +0.09 R; (kpc) 0.14+0.03 
sh 0.0082 + 0.0003 B 0.80+0.02 

Osh (°) 29.9+3.0 é 2.0+40.4 
0» (kms"‘) 58.141.8 

Ro (kpc) 1.740.1 


The lens parameters describe a projected mass density profile with a cored elliptical power-law distribution (equation (1)) plus the contribution of an external shear component (Ij, 9,,). The 
kinematic parameters describe a rotating disk with a rotation curve defined by a multi-parameter function (equation (2)), a velocity dispersion profile defined by an exponential function (do, Ro) 
and a geometry defined by the inclination (i) and the position angles (PA). 


Extended Data Table 2 | Kinematic properties for SPTO418-47 
derived under different assumptions 


Global kinematic properties 


2 Vitex (kms!) 30844 *Verat (kms ') 25941 
om (kms) 3241 ® ext (kmS~') 1841 
Vutax | Om 9.7+0.4 Vetat/ Text 13.5+0.7 


Parameters representing the rotation velocities and velocity dispersion profile, as well as the 
rotation support for this galaxy, calculated under different definitions. 

*Maximum rotation velocity. 

’Mean velocity dispersion. 

°Rotation-to-random-motion ratio, calculated from V,,, aNd Om. 

4Flat rotation velocity, calculated using the flat part of the rotation curve (R > 2 kpc; see Fig. 2c). 
°Velocity dispersion at outer radii (R > 1 kpc). 

‘Rotation-to-random-motion ratio, calculated from Via, ANd Oexe- 
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Extended Data Table 3 | Kinematic measurements for the comparison samples 


Study Tracer Vv Oo 
Lelli et al." HI Flat part of the rotation curve (Veiat) 7140 km/s 
Swinbank et al.°° Ha, [Oll] Extracted at 3Rpisx Average (Om) 
Harrison et al.7° Ha Extracted at 3.4Rpisx Median value at R > 3.4 Roisx 
Di Teodoro et al.° Ha Flat part of the rotation curve (Veit) Average (Om) 
Wisnioski et al.7° Ha (Vey, max — Vev, ee Average from outer regions (d¢xt) 
Lelli et al.” [Cl] Flat part of the rotation curve (Veiat) Average (Om) 
Tumer et al.28 [Ol] Extracted at 3.4Rpisx Median 


See refs. "58°, 


®We assume a value of 0=10 kms", typical of Hin local spiral galaxies". 
Veymax ANd Vpymin are the maximum and minimum velocities, respectively, along the position-velocity diagram. 


Extended Data Table 4 | Assumptions for the dynamical fit 


Component Density profile Free parameters Priors 
Stars Sérsic Mstar [10’, 10'"] Mo 
Re [0.04, 2.0] kpc 
n [0.5, 10] 
Gas Exponential Aci [3.8, 238.0] M,/Lo 


Dark matter NEW Mom [10"°, 10'9] Mo 
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Extended Data Table 5 | Physical quantities for SPTO418-47 
derived from the kinematic and dynamical modelling 


Parameters of the dynamical model 


Derived parameters 


Mgtar (10'° Mo) 1254 
Re (kpc) 0.22°5 02 

n 2o'e3 

Mom (10'? Mo) 1.7793 

aici (Mo/Lo) 7.3743 


®Moas (10'°Mo) 1.3293 
° Mar (10'°Mo) 2.5.94 
°Re, bar (kpc) [0.7579 08] 
“fom(< Re) 0.01879 :003 
[0.095°5: G07] 
°F, (107°) TAL 
[14.936] 
"Ties 0.53508 
7Voq9 (kms~') 32075 
"Roo (kpc) 70% 
'Vinax (km s~") 32378 


Left: parameters inferred from a dynamical fit to the circular velocity. The stellar component is 
described by a Sérsic profile, the gas disk by an exponential profile and the dark matter by an 
NFW profile. Right: the quantities in square brackets are calculated considering the gas 
component, under the assumption that all the gas that we observe today will be converted 


into stars, preserving the disk configuration. 
“Total gas mass, computed as Mgas = Afeuy fou: 
"Total baryonic mass, computed as Mya, = Metar + Maas: 


°Baryonic half-mass radius. 


Dark-matter fraction within the half-mass radius. 


°Stellar-to-halo-mass ratio, Meta/Mom = (Mstar + Mygas)/Mom- 


‘Gas fraction, Mgas/(Mstar + Mgas)- 
%Virial velocity of the dark-matter halo. 
*Virial radius of the dark-matter halo. 


‘Maximum velocity for an NFW halo, computed as Vo90) 0.216 c/[In(1+c)-c/(1+c)]. 
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Quantum superpositions of macroscopically distinct classical states—so-called 
Schrédinger cat states—are a resource for quantum metrology, quantum 
communication and quantum computation. In particular, the superpositions of two 
opposite-phase coherent states in an oscillator encode a qubit protected against 
phase-flip errors'”. However, several challenges have to be overcome for this concept 
to become a practical way to encode and manipulate error-protected quantum 
information. The protection must be maintained by stabilizing these highly excited 
states and, at the same time, the system has to be compatible with fast gates on the 
encoded qubit and a quantum non-demolition readout of the encoded information. 
Here we experimentally demonstrate a method for the generation and stabilization 
of Schrédinger cat states based on the interplay between Kerr nonlinearity and 
single-mode squeezing’” in a superconducting microwave resonator*. We show an 
increase in the transverse relaxation time of the stabilized, error-protected qubit of 
more than one order of magnitude compared with the single-photon Fock-state 
encoding. We perform all single-qubit gate operations on timescales more than sixty 
times faster than the shortest coherence time and demonstrate single-shot readout 
of the protected qubit under stabilization. Our results showcase the combination of 
fast quantum control and robustness against errors, which is intrinsic to stabilized 
macroscopic states, as well as the potential of of these states as resources in quantum 


information processing* ®. 


A quantum system that can be manipulated and measured tends to 
interact with uncontrolled degrees of freedom in its environment, 
leading to decoherence. This presents a challenge to the experimen- 
tal investigation of quantum effects and in particular to the field of 
quantum computing, where quantum bits (qubits) must remain coher- 
ent while operations are performed. Most noisy environments are only 
locally correlated and thus cannot decohere quantum information 
encoded inanon-local manner. Therefore, quantum information can 
be protected through the use of spatial distance’ “ or entangled qubit 
states”, Crucially, this concept can be extended to non-local states 
in the phase space of a single oscillator”, with the additional benefit 
of involving fewer physical components, a property termed hardware 
efficiency. The latter is desirable because fully protecting a quantum 
system against all forms of decoherence is likely to involve several layers 
of encodings, and it is crucial to introduce efficient error protection 
into the physical layer while maintaining simplicity®””. 

Anatural choice for non-locally encoding a qubit into the phase space 
of anoscillator is superpositions of macroscopically distinct coherent 
states—the so-called Schrédinger cat states. Here we choose the states 
\c*) =(|+a)+|-@))/./2 with average photon number f= |al? and, 
respectively, even and odd photon number parity as the Zeigenstates 
of the encoded qubit (Fig. 1a). The coherent states| + aw) and|- a) are 


the approximate X eigenstates (Supplementary Information section]), 
and their distance in phase space ensures protection against any noise 
process that causes local displacements in this space. Crucially, this 
leads toa suppression of phase flips that is exponential in the average 
photon number @ (refs. '”). In particular, photon loss, the usual noise 
process in an oscillator, cannot induce transitions between|+ a) and 
|- a) because they are eigenstates of the annihilation operator d. This 
is not the case for their superpositions, so a stochastic photon-loss 
event corresponds to a bit-flip error on the encoded qubit: 4|C=) = a|C2), 
which also affects the parity-less Yeigenstates|C=') =(|+ a) <il-a@))/J2, 
wherei = /-1. However, for a given single-photon-loss rate x,, the bit- 
flip rate of approximately 2/ix, (ref. °, Supplementary Information 
section XI) increases only linearly with the photon number. A qubit 
with such ‘biased noise’ is animportant resource in fault-tolerant quan- 
tum computation’. Additional layers of error correction can then 
focus strongly on the remaining bit-flip error®”"®. This substantially 
reduces the hardware complexity compared with conventional 
approaches that use qubits without such error protection. 

Here we show that such a protected qubit can be stabilized 
autonomously in a simple and versatile implementation similar toa 
superconducting transmon under parametric driving. Unlike in other 
hardware-efficient encodings”, the most nonlinear mode of our 
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Fig. 1| Qubit encoding, stabilization and implementation. a, Bloch sphere of the 
protected Kerr-cat qubit (KCQ) in the large-a limit (Supplementary Information 
section 1). The states on all six cardinal points are labelled, indicated by coloured 
markers, and their Wigner function” phase-space representations are sketched. 
Here, |+Z) = |C) = (|+ @)+|- @))/J/2 and|£Y)=|Cz) = (|+ a) ¥il - @))//2. The 
continuous X(6) gate with the arbitrary rotation angle 9 and the discrete Z(1t/2) 
gate are shown bya purple circle and a blue arrow, respectively. b, Bloch sphere of 
the single-photon Fock qubit (FQ). c, Energy (F) dependence of equation (1) on 
classical phase-space coordinates Re(a) and Im(a) for squeezing drive amplitudes 
€,/21 =17.75 MHz (left) and €, = 0 (right) with a sketch showing the adiabatic ramp 
of the drive over a time Tyamp > 1/2K. Black lines are constant energy contours. The 
quadrature expectation values of the|+X) states froma, bare indicated by their 
respective markers. d, Photograph of the nonlinear resonator (purple frame) 
inside the copper section of the readout cavity (Methods). Also represented are the 
X(8)-rotation drive (w,) and the squeezing-generation drive (2w,).e, Schematic of 
the nonlinear resonator with pad offset 6 to set the dispersive coupling to the 
readout cavity (Methods) and spiral symbol representing the nonlinear inductor 
(SNAIL element). f, Scanning electron micrograph of the SNAIL element consisting 
of four Josephson junctions ina loop threaded by an external magnetic flux ®. 


system encodes and stabilizes the qubit without requiring auxiliary 
nonlinear modes that could introduce additional uncorrectable errors. 

For this protected qubit to be practical, it is essential that operations 
can be performed faster than the shortest decoherence timescale, 
here the bit-flip time. We experimentally demonstrate such fast gate 
operations and quantum non-demolition single-shot readout in 
a system that maintains phase-flip protection via the simultaneous 
stabilization of two opposite-phase coherent states. 

Our approachis based onthe application of a resonant single-mode 
squeezing drive toa Kerr-nonlinear resonator*. In the frame rotating at 
the resonator frequency w,, the system is described by the Hamiltonian 


Aca = Kaa? + €,(a"" + a), (1) 


where K is the Kerr nonlinearity, €, is the amplitude of the squeezing 
drive and his the reduced Planck’s constant. Some intuition on this 
system can be gained from computing (A.,,)as a function of classically 
treated phase-space coordinates. There are two stable extrema at 
+a=+,/€,/K,as indicated by the markers in Fig. 1c. They correspond 
to the lowest degenerate eigenstates of the quantum Hamiltonian* 
(Supplementary Information) and thus do not decay to vacuum. 
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These eigenstates are separated from the rest of the spectrum by an 
energy gap E,,,/h=4Kn (Supplementary Information section III, 
Supplementary Fig. 1), which sets the speed limit for operations and 
readout. The energy barrier between these eigenstates prevents 
jumps along the X-axis of this ‘Kerr-cat qubit’ (KCQ). If no squeezing 
drive is applied, A.,, reduces to the Hamiltonian of an anharmonic 
oscillator. This resembles a superconducting transmon with anharmo- 
nicity 2K, commonly used to encode a ‘Fock qubit’ (FQ) into the first 
two photon number states |0) and |1) (Fig. 1b). Its classical energy dis- 
plays one single extremum in phase space harbouring the quadrature 
expectation values of both X eigenstates without the protection of an 
energy barrier (Fig. 1c, right panel). By toggling the squeezing drive, a 
Kerr-nonlinear resonator can be tuned to implement either type of 
qubit. 

Our experimental implementation consists of a superconducting 
nonlinear resonator placed inside a three-dimensional (3D) micro- 
wave cavity (Fig. 1d). This is a standard setup in 3D transmon qubits 
with a few key modifications (Methods). The foremost modification, 
employing a superconducting nonlinear asymmetric inductive ele- 
ment (SNAIL)” as a nonlinear inductor (Fig. 1f), allows us to create 
single-mode squeezing by applying a coherent microwave drive w, 
at twice the resonator frequency w, and makes the resonator flux 
tunable. Here we tune our device to a frequency w,/2Tt = 6 GHz and 
Kerr nonlinearity K/21t = 6.7 MHz. At this frequency, the FQ has an 
amplitude damping time 7, = 15.5 ps and a transverse relaxation time 
T,=3.4 ps. 

The FQ is employed for initialization and measurement of the KCQ 
during most experiments described in this work. This is possible 
because the states |0), |C;) (1), |C,) spanning the two Bloch spheres 
have the same even (odd) photon number parity, which is conserved 
by the system Hamiltonian (equation (1)). Consequently, ramping the 
squeezing drive on and off slowly with respect to 1/2K, as sketched in 
Fig. 1c, adiabatically maps between the FQ and KCQ (Methods). 

We now show that we indeed implement the Hamiltonian (1), and 
thus initialize and stabilize a KCQ, by demonstrating the unique features 
of Rabi oscillations around the X axis of its Bloch sphere. To this end, 
we apply an additional coherent drive ¢,4' + ed with amplitude ec, 
and frequency w, = @,/2 to the system. This lifts the degeneracy between 
the states| + a) and|- a), and therefore leads to oscillations witha Rabi 
frequency 


0, = Re(4e€,a) (2) 


between their superposition states along the purple circle in Fig. 1a. 
This picture is valid for large enough a and for €, «E,,/h (ref. ‘ssee 
also Supplementary Information section IV). Note that equation (2) is 
different from the Rabi frequency of aFQintwo ways. First, it depends 
on the amplitude of the squeezing drive through a , /€,. Second, it 
varies with the phase of the applied Rabi drive arg(e,) (for simplicity 
we chose €,,aE R). 

We first focus on the effect of the squeezing drive on the Rabi fre- 
quency. We initialize in|C;) and apply a Rabi drive with constant |e,| and 
arg(e,) = O for a variable time Atand a variable amplitude e, (Fig. 2a). 
Figure 2b shows the Rabi frequencies for each €,, extracted from the 
oscillations inthe measured FQ|0)-state population fraction at the end 
of the experiment. For €, =0, we are simply driving FQ Rabi oscillations 
giving a direct calibration of |e,|/2m = 740 kHz. For large values of €,, 
the Rabi frequency becomes a linear function of ,/e,, confirming the 
theoretical prediction of equation (2). The solid black line shows a 
one-parameter fit to a numerical simulation of our experiment 
(Methods). 

We nowturn to another unique feature of these Rabi oscillations by 
setting €,/2m = 15.5 MHz and varying At and arg(e,). As expected, the 
measured oscillations shown in Fig. 2c are 1-periodicin arg(e,). Three 
cuts through this data (dashed lines) are shown in Fig. 2d. The top panel 
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Fig. 2 | Rabi oscillations of the protected KCQ. a, Pulse sequence to perform 
the following functions: (1) initialize the KCQ (|0) > |C3)), (2) drive Rabi 
oscillations for a varying time At, and (3) map onto the FQand perform 
dispersive readout. w,, w,/2 and @, are the frequencies of the respective drives. 
A black arrow indicates the endpoint of the numerical simulations performed 
for b, d, e (Methods). b, Dependence of the Rabi frequency Q, on /é;. 
Experimental data are open grey circles. The solid black line is a one-parameter 
fit to the data used to calibrate €, (Methods). The red star indicates the 
condition €,/2m =15.5 MHzused for c,d, e.c, Dependence of the experimentally 
measured Rabi oscillations on time Atand on the phase of the Rabi drive arg(e,). 
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corresponds to a phase difference of 1/2 between the coherent state 
amplitude and the Rabi drive, meaning that oscillations are suppressed. 
The middle and bottom panels at respective phase differences of 11/4 
and 0 display increasing Rabi frequencies, with the latter corresponding 
tothe red star marker in Fig. 2b. Inthe bottom panel, the black line is the 
result ofanumerical simulation scaled to match the contrast of the data. 
The black lines in the top and middle panels use the same scaling factor 
and are thus parameter-free predictions in good agreement with the 
measured data. Having benchmarked our simulation inthis way, we use 
itto compute the full density matrix of the resonator, which we represent 
with Wigner functions (Fig. 2e). Apart from slight asymmetries due to 
the finite ramp time of the initial mapping pulse, they agree well with 
the expected|+Z),|+ Y),|-Z)and|— Y) states of the KCQ. 

Next, we characterize the mapping operation and aset of single-qubit 
gates on the KCQ by performing process tomography (Methods). In 
all subsequent experiments, the average photon number of the cat 
states is set to f= 2.6 and frequency shifts induced by the squeezing 
drive are taken into account by setting w,/2=@,, where @, is the 
Stark-shifted resonator frequency (Supplementary Information section 
Il). The pulse sequence for tomography of the mapping between the 
FQ and the KCQ is shown in Fig. 3a and the measured state vectors are 
plotted on a Bloch sphere in Fig. 3b. An estimate of the fidelity 
Fnap = 0.855 + 0.002 (+ one standard deviation here and for all subse- 
quent values) is obtained by using the Paulitransfer matrix approach” 
(Supplementary Information). This number reflects the fidelity of the 
tomography FQ pulses as well as of the mapping itself, because, apart 
from anormalization by the FQ Rabi contrast, the presented fidelities 
include state-preparation-and-measurement (SPAM) errors (Supple- 
mentary Information section V). We expect these errors to be domi- 
nated by decoherence during the comparatively slow adiabatic ramps. 
This could be remedied in future experiments by using optimal pulse 
shapes, which can reduce the duration of the mapping operation bya 
factor of more than 40 with respect to its present value’. 

We nowturnto the pulse sequence shown in Fig. 3c, which addition- 
ally performs an X(1t/2) gate on the KCQ. The process tomography data 
(Fig. 3d) shows the desired rotation around the X-axis with a fidelity of 
Fy(n/2) = 0.8574 0.001 . Comparing this value to F,,,, indicates that 
Fy qj2) is mostly limited by SPAM errors. From a complementary 
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The colour scale gives the ground state population of the FQ (P,) at the end of 
the experiment. d, Cuts of c for the three Rabi-drive phases indicated by dashed 
lines. Open grey circles are the experimental data and black lines are the 
simulation. Symbols inthe bottom panel indicate the times for which the 
simulated oscillator state is shown ine. e, Simulated phase-space 
representation of the oscillator density matrix corresponding tothe|+Z), 
|+Y),|-Z)and|- Y) states of the KCQ (from top to bottom). The colour scale 
gives the value of the Wigner function W(a) asa function of the real and 
imaginary part of the phase-space coordinate a. 


a b 
Squeezing drive (= 0, 
Tomography pulses A A 

=X #7, a2 Oa 
Readout im 

®% 
X), (Y), 2) 

c 
Squeezing drive [-\ és 


KCQ Rabi pulse 


Tomography pulses A Txini2) N\ 
o 


+X, +Y, +Z 7 
—_—__—_ Ba... 
0%), @) 


Readout 


Squeezing drive 


Tomography pulses A Ta 12) A 
=X, 4Y, 22 a 
Readout jim 
% 


X), Y), @) 


Fig. 3 | KCQ gate process tomography. a, c, e, Pulse sequences for process 
tomography of mapping between FQ and KCQ, which is ideally the identity 
operation 1 (a), mapping and the X(11/2) gate (c), as well as mapping and the 
Z(tt/2) gate (e). Ineach sequence, the FQ is initialized onthe +X,+Yand+Z 
cardinal points of the Bloch sphere, the respective operation is performed, 
and the expectation values <X ), (Y) and <Z) are measured by acombination 
of FQ pulses and dispersive readout (grey box). In, Tyq2) = 24 nsis the total 
duration of the Gaussian Rabi pulse applied to the KCQ. Ine, T,/2) =38 ns is the 
duration for which the squeezing drive is switched off to perform the gate. 

b, d, f, Process tomography results presented onaBloch sphere for the 
operations 1 (b), X(1t/2) (d) and Z(11/2) (f). The expectation-value vector after 
the operation for an initialization on the (+X, —X, +, -Y, +Z, -Z) cardinal pointis 
represented by an (orange square, green diamond, gold right-facing triangle, 
purple left-facing triangle, red upward-facing triangle, blue downward-facing 
triangle). Error bars are smaller than the size of the markers. The fidelities are 
0.855+ 0.002, 0.857 + 0.001and 0.811+ 0.001, respectively. 
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Fig. 4 | Cat-quadrature readout (CR) and coherence times. a, Top and middle: 
histogram of the cavity output field when performing CR (see b) after preparation 
of either| + @) (top) or|— a) (middle). Bottom: corresponding probability 
distribution along the / quadrature. Open orange (green) circles show measured 
data for| + a) (|- a)) and solid lines are Gaussian fits of width o used to scale the 
quadrature axes, /and Q. Setting a threshold at //o= 0 (dashed line) implements a 
direct single-shot readout of the KCQ along its X-axis. b, CR pulse sequence for the 
measurements presented ina, c. After state initialization in|+a)(Y(+m/2) gate on 
the FQand mapping), a pulse at frequency w,, = @, - @,/2 is applied for atime 

T., =3.6 ps converting the quadrature amplitude of the KCQ toa drive onthe 
readout cavity at w,. The wait time Atis set to zero to obtain the results shownina 


measurement (Supplementary Fig. 3), we estimate the infidelity 
due to over-rotation and decoherence during the gate operation to 
about 0.01. 

As this operation is compatible with an arbitrary angle of rotation, 
only a tt/2 rotation around the Z-axis is needed to reach any point on 
the KCQ Bloch sphere. Nominally, sucha gate is incompatible with the 
stabilization as it could be used to go between the states| + w)and|- @). 
However, for €, = 0, the free evolution of the Kerr Hamiltonian for a 
time 11/2K = 37.3 ns achieves the required operation*™**> (Methods and 
Fig. 3e). The tomography data shown in Fig. 3f gives a fidelity 
Fz q/2)* 0.8114 0.001 . We attribute the reduction of fidelity with 
respect to F,,,, to the difference between the actual gate time 
Tym = 38 ns and 1t/2K, and to the finite rise time of the step function 
in €, of about 4 ns; both of which are not limitations of our device but 
of our room-temperature electronics (Supplementary Information 
section VI). 

So far, we have characterized the basic properties and gate opera- 
tions of the KCQ by mapping back onto the FQ and using the 
well-understood dispersive readout method. This readout, however, 
destroys the state of the KCQ. We now demonstrate an entirely new 
way to perform a quantum non-demolition measurement on the X 
component of the stabilized KCQ, which we call the ‘cat-quadrature 
readout’ (CR). We apply an additional drive at frequency w,,=@,- @,/2, 
where w, is the frequency of the readout cavity. Through the three-wave 
mixing capability of our system, this generates a frequency-converting 
interaction between the nonlinear resonator and the readout cavity. 
In the frame rotating at both w,/2 = @, and w,, this adds the following 
term to equation (1): 


Ai, /h= ig, (ab'- ab). (3) 


Here, bis the annihilation operator of the cavity field and g.,/21=1.7 MHz 
is the independently measured coupling strength (Supplementary 
Fig. 6). For a quadrature expectation value (4+ d')/2=+a inthe 
nonlinear resonator, this causes an effective coherent drive on the 
cavity and projects the KCQ onto the corresponding state|+@) along 
its X-axis. We gain information about the result of this projection by 
measuring the emitted cavity field. 
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and varied inc. c, KCQ <X)-component coherence. Open blue circles are data and 
solid black lines are single-exponential fits with decay times T,,=105 p's + 11s and 
T_y=106 ps +1 ps. d, Pulse sequence for e. After initialization in|C*}) (mapping 

|0) > |C5)and X(+m1/2) gate) and variable wait time At, a Z(11/2) gate is performed 
followed by CR. e, KCQ<Y)-component coherence. Open blue circles are data and 
solid black lines are single-exponential fits with decay times T,,= 2.51 pts + 0.06 ps 
and t_,=2.60 ps + 0.05 ps. f, Pulse sequence for g. After initialization in|C%) 
(mapping|0) > |C7) and either X(O) or X(1) gate) and variable wait time At, aX(11/2) 
gate and a Z(11/2) gate are performed followed by CR. g, KCQ<Z)-component 
coherence. Open blue circles are data and solid black lines are single-exponential 
fits with decay times T,7=2.60 ps + 0.07 ps and T_7=2.56 ps + 0.07 ps. 


We characterize the fidelity of this readout by first initializing the 
KCQalong its X-axis and then applying a CR pulse for atime 7,, =3.6 ps 
as shown in Fig. 4b. Two histograms of the measured cavity field are 
shown in Fig. 4a for initialization in| + a) and|-— a), respectively. Their 
separation is large enough to implement a single-shot readout by set- 
ting a threshold at //o=0 with total fidelity ¥ = 0.74 (Methods), where 
I/oisadimensionless quantity corresponding to the /-quadrature sig- 
nal divided by the the standard deviation o of the histograms. This 
fidelity is a lower bound including errors in state preparation caused 
by the thermal population of the FQ |1) state (contributing an infidelity 
of about 2 x 4% =8%) and imperfections during the initial FQ pulse and 
mapping. Finally, we characterize the quantum-non-demolition aspect 
of the CRas O = 0.85 from two successive measurements (Methods). 

We nowuse this CR to investigate the phase-flip time of the KCQ. The 
decay of the <X) component for either initial state along this axis is 
measured using the pulse sequence shown in Fig. 4b. We fit the data to 
asingle-exponential decay with characteristic times T,,=105 ps +1 ps 
and t_y=106 ps +1 ps. Additional measurements with dispersive read- 
out confirm this result (Supplementary Fig. 7). 

Similarly, the coherence times of both the (Y) and {Z) components 
are measured using CR, but employing only operations on the KCQ 
after the initial |O) > |C;) mapping operation (Fig. 4d, f). The resulting 
decay curves are displayed in Fig. 4e, g. Single-exponential fits of the 
data yield the decay times T,,=2.51 ps + 0.06 pls, T_y=2.60 ps + 0.05 ps, 
T,7= 2.60 ps + 0.07 ps and t_;=2.56 ps + 0.07 us. These values are slightly 
smaller than the predicted bit-flip time due to photon loss 
Tioss = 1)/2N= 2.98 ps. We expect that photon-gain processes play a role 
inthis reduction (Supplementary Information section XI, Supplemen- 
tary Fig. 8). 

Our results demonstrate a 30-fold increase in the phase-flip time 
of the protected KCQ with respect to the FQ. Crucially, we performa 
full set of single-qubit gates on the protected qubit on timescales that 
are much shorter than its bit-flip time. Although the measured gate 
fidelities are limited by SPAM errors, an upper bound of the error 
rate due to decoherence during the gate operations is given by: 
Tycuja/ Try © Tzua/ Try < 2 * 10%. The combination of error protection, 
fast gates and single-shot readout opens the door to using stabilized 
Schrédinger cat states as physical qubits ina future quantum computer. 


The simplicity of our implementation provides a straightforward path 
to coupling several KCQs and demonstrating operations between them. 
In particular, our qubit permits a noise-bias-preserving controlled-NOT 
gate®, which would be impossible with standard two-level qubits’. More- 
over, KCQs could be applied as auxiliary systems for fault-tolerant error 
detection on other logical qubits>. This will require further improve- 
ments in device performance suchas the bit-flip time, currently limited 
by losses due to the copper cavity. 

The limitation of the phase-flip time also requires further investiga- 
tion. Measurements performed at other flux points with different 
strengths of the third- and fourth-order nonlinearities indicate that 
coherence decreases when stronger drives have to be applied to the 
system to reach similar photon numbers. Similarly, an increase in pho- 
ton number beyond the # = 2.6 operating point chosen in this work 
decreases coherence. This is probably related to heating effects asso- 
ciated with the strong driving of Josephson junction devices” caus- 
ing leakage to higher excited states outside of the KCQ encoding. Such 
leakage can be counteracted through controlled two-photon dissipa- 
tion back towards the states of the KCQ®. This dissipation-based 
approach to stabilization has been effective in achieving strongly 
biased noise”’, but quantum operations that are much faster than all 
coherence timescales of the encoded qubit remain difficult to achieve”. 
The optimal solution should be to rely ona combination of two-photon 
dissipation, squeezing drive and Kerr nonlinearity for phase-flip 
suppression and on the Kerr nonlinearity for high gate speeds. 

In addition to its applications in fault-tolerant quantum compu- 
tation* °°, our system extends the understanding of bistability in 
parametrically driven Kerr nonlinear oscillators from the classical 
regime*”’ * to the quantum regime where nonlinearity dominates over 
losses***>3*5, Our results demonstrate long-lived quantum superpo- 
sitions of degenerate, macroscopically distinct semi-classical states 
arising from Hamiltonian bistability. Such states could shed light on the 
quantum-classical transition® and can be useful in weak force measure- 
ments”. Networks of coupled bistable oscillators can be mapped onto 
Ising spins and used to investigate non-equilibrium quantum phase 
transitions®® or to solve combinatorial optimization problems” “. 
These examples suggest that our Kerr-cat system is likely to be applied 
bothto quantum computation and for the investigation of fundamental 
quantum effects. 
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Methods 


Experimental implementation 

As described in the main text, our experimental implementation con- 
sists of a superconducting nonlinear resonator placed inside a 3D 
microwave cavity (Fig. 1d, Supplementary Information section VI). The 
main differences with respect to a 3D transmon qubit are as follows. 
Large capacitor pads help reduce the Kerr nonlinearity of the resonator 
relative to a transmon and thus limit the drive strength €, necessary to 
reach appreciable coherent-state amplitudes. We orient the resonator 
such thatits dipole momentis perpendicular to the electric field direction 
of the lowest-frequency cavity mode (w,/21t = 8.9 GHz) to avoid strong 
hybridization despite the large pads, and reintroduce a small precisely 
tuned coupling necessary for dispersive readout by slightly offsetting the 
pads (note 6in Fig. le). Furthermore, instead of a single Josephsonjunc- 
tion, weemploy a SNAIL” (Fig. 1f). This makes the resonator flux tunable 
and endows it with both third- and fourth-order nonlinearities. We use 
the former to generate single-mode squeezing by applying a coherent 
microwave drive w, at twice the resonator frequency, thus converting 
one drive photon intotwo resonator photons through three-wave mixing; 
the latter yields the required Kerr nonlinearity. An in-depth description 
of the device design and a full list of system parameters are given in Sup- 
plementary Information sections VI, VII, Supplementary Table 2. 


Adiabatic mapping between the FQ and KCQ Bloch spheres 
Without the large-a approximation from the main text, the expressions 
for the even and odd cat states are given by |CZ) =NVz(|+ @)+|-@)), 
where a=./e€,/K is the amplitude of the coherent states in the 
superposition and Vz = =1/./2(1te 2") is anormalization coefficient 
accounting for|{ + a@| — a)| = e 7" 4 O(ref.*, Supplementary Information). 
These expressions are valid for all €,/K and, in the limit €,/K > O, they 
become |C;) > |n=0) and |C,) > |n=1). This validates the mapping 
between the FQ and KCQ Bloch spheres, which share acommon defini- 
tion|+Z) = |C;) for their respective values of a. We perform the mapping 
by ramping the squeezing drive on and off with a hyperbolic-tangent 
profile over 320 ns. To avoid leakage to higher excited states, this time 
is chosen to be much longer than 1/2K, where 2K corresponds to the 
anharmonicity of the FQ. 


Numerical simulation 

Here we describe the numerical simulation performed to obtain the 
results shown in Fig. 2. For the data presented in Figs. 3, 4 we compen- 
sate for all drive-related detunings by calibrating the drive frequen- 
cies (Supplementary Information section VIII, Supplementary Fig. 5). 
However, when measuring the data shown in Fig. 2, it would be imprac- 
tical to perform this calibration for each drive strength €,. Instead we 
apply the tone generating the squeezing drive at twice the unshifted 
mode frequency w,=2w, and the X-rotation drive a frequency w,/2. The 
resulting system Hamiltonian is 


at2 at 


A/n=-Kal?a@’ +3g,6(a"" + a) - 4K EP a'a+€,a' + €%a, 
where we have expressed €, in terms of the independently calibrated 
third-order nonlinearity g, ~ 20 MHz and the dimensionless drive 
strength € (Supplementary Information section II). The third term 
corresponds to a drive-dependent frequency detuning due to the a.c. 
Stark shift. 

We performa numerical simulation of our system, which is described 
by the master equation 


ia , og atia 
=- ral p)+x,1+n,)D[alp + KND1a'lp, 


here D[OIp = Of0' - 50'0p - 5600, fis the simulated density 
matrix of the nonlinear resonator, the dot indicates the time 


derivative, x, =1/T, is the single-photon-loss rate of the nonlinear 
resonator and n,, = 0.04 its equilibrium thermal |1)-state occupation 
number. Our simulation mimics the experiment using the same rise 
times for the tanh ramps of the squeezing drive (320 ns) and the 
X-rotation drive (80 ns) up to the point just before mapping back 
onto the FQ marked by a black arrow in Fig. 2a. At this point, we com- 
pute (Ca|p|Cq). 

We calibrate the abscissa of Fig. 2b by fitting this simulation to the 
Rabi frequency obtained at the maximum strength of the squeezing 
drive using the corresponding value of €(or equivalently €,) as the only 
free parameter. We then perform the simulation for all other values of 
€, to obtain the solid black line shown in the figure. The black line in 
the bottom panel of Fig. 2d corresponds to the result of one of these 
simulations (marked bya red star in Fig. 2b) scaled to match the contrast 
of the data. This accounts for the fact that in the experiment the state 
IC, is mapped onto the|0)state of the FQ, whichis then measured. This 
process as well as the finite mapping and measurement fidelities are 
not part of the simulation. The black lines in the other two panels of 
this figure are obtained by changing arg(e,) but using same scaling 
factor. 


Experimental details of the gate process tomography 

Here we describe pulse sequences used to obtain the data presented 
in Fig. 3a, c, e. At the beginning of each sequence, the FQ is initialized 
in one of the six states|+X),|+Y) and|+Z). These are then adiabati- 
cally mapped onto the KCQ. Each sequence ends with the reverse 
mapping and measurement of the resulting (X), (Y) and (Z) compo- 
nent of the FQ state using appropriate FQ pulses (grey box in the 
figures). 

To perform the X(11/2) gate, we apply a Gaussian pulse with a duration 
of 24 ns anda maximum amplitude of ey, x= 6.5 MHz. 

To implement a Z(m/2) gate, we abruptly set the squeezing 
drive amplitude to zero for a duration 7;,,,/)=38 ns and then switch 
it on again, as shown in Fig. 3e. Our room-temperature electronics 
limit the rise and fall times of €, to about 4 ns with a maximum time 
resolution of 2 ns. 


Experimental details of the CR 

During the CR, we perform heterodyne detection of the emitted cavity 
field. The data in Fig. 4a are a histogram of 3 x 10° measurements of this 
demodulated and integrated field. 

We conservatively define the total readout fidelity as ¥ =1- 
p(-al+a)—p(+a|- a) =0.74, where p(-a| +a) = 0.13 (p(+a | -a@) = 0.13) 
is the probability of measuring the qubit in| — a) (| + a)) after initializa- 
tionin|+a@)(|-@)). 

We characterize the quantum-non-demolition aspect by evaluating 
QO=(P(+alt+a)+P(-al-a))/2 = 0.85, where P(i| i) is the probability 
of obtaining the measurement outcome iin two successive measure- 
ments (Supplementary Information section IX, Supplementary 
Fig. 6). 


Supplementary information content 

The accompanying Supplementary Information contains further 
details on: the general encoding (section 1), the full system Hamilto- 
nian (section II), direct spectroscopy of the energy gap (section III, 
Supplementary Fig. 1), the continuous X rotation (section IV), the 
characterization of the gate operations (section V, Supplementary 
Figs. 2,3, Supplementary Table 1), the experimental setup (section 
VI, including a detailed wiring diagram in Supplementary Fig. 4), 
the system parameters (section VII, including a summary Supple- 
mentary Table 2), the experimental tune-up sequence (section VIII, 
Supplementary Fig. 5), the CR (section IX, Supplementary Fig. 6), 
the measurement of the KCQ coherence times obtained by mapping 
back onto the FQ and performing dispersive readout (section X, 
Supplementary Fig. 7) and a discussion of the impact of different 


decoherence processes on the KCQ (section XI, Supplementary 
Fig. 7). 


Data availability 


The data that support the findings of this study are available from the 
corresponding authors upon reasonable request. 


Code availability 


Numerical simulations were performed using a Python-based open 
source software (QuTiP). Custom Python code was used to obtain 
and analyse the experimental data following standard practices as 
outlined in Methods and the Supplementary Information. The code 
used in this study is available from the corresponding authors upon 
reasonable request. 
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Of the two stable forms of graphite, hexagonal and rhombohedral, the former is more 


common and has been studied extensively. The latter is less stable, which has so far 
precluded its detailed investigation, despite many theoretical predictions about the 


abundance of exotic interaction-induced physics" 


6, Advances in van der Waals 


heterostructure technology’ have now allowed us to make high-quality rhombohedral 
graphite films up to 50 graphene layers thick and study their transport properties. 
Here we show that the bulk electronic states in such rhombohedral graphite are 
gapped’ and, at low temperatures, electron transport is dominated by surface states. 
Because of their proposed topological nature, the surface states are of sufficiently 
high quality to observe the quantum Hall effect, whereby rhombohedral graphite 
exhibits phase transitions between a gapless semimetallic phase and a gapped 
quantum spin Hall phase with giant Berry curvature. We find that an energy gap can 
also be opened inthe surface states by breaking their inversion symmetry by applying 
a perpendicular electric field. Moreover, in rhombohedral graphite thinner than 

four nanometres, a gap is present even without an external electric field. This 
spontaneous gap opening shows pronounced hysteresis and other signatures 
characteristic of electronic phase separation, which we attribute to emergence of 
strongly correlated electronic surface states. 


Films of rhombohedral graphite (RG; also described as chirally stacked 
ABC graphene multilayers) are predicted to possess nearly flat bands 
localized at the surfaces; these bands are particularly suitable for 
exploring electron-electron interactions’. In RG, interlayer hop- 
ping (parameterized by y,; see Fig. 1a) dimerizes the electronic states 
of the opposite sublattices of all contiguous graphene layers in the 
bulk, shifting their energies away from the Fermi level®!°. Electrons of 
only two sublattices, one in the top layer, and the other in the bottom 
layer, remain at low energies. These surface states are considered to be 
equivalent to topological edge states in the ‘polyacetylene’ Su-Schrief- 
fer-Heeger model". The surface states residing on the inequivalent sub- 
lattices of the opposite surfaces of N-layer-thick samples are expected 
to form nearly flat bands with approximate energy dispersion relations 
of the form E~ +p” (refs. **"), where pis the in-plane momentum. These 
surface states are further theorized to have strong electron-electron 
interactions, hypothetically leading to spontaneous quantum Hall 
states, ferromagnetism and superconductivity! ©”. When considering 
other hopping parameters (y,-y, are shown in Fig. 1a; Methods), the 
surface flat bands acquire a finite bandwidth of the order of 2y,y,/Vo 
as well as electron-hole asymmetry, and trigonal warping due to y, 
and y; (Fig. 1b), which could affect electron-electron interactions, 


spontaneous symmetry breaking and the opening of a bandgap (Fig. Ic). 
Owing to the metastable nature of RG, the reported transport experi- 
ments have been limited to ABC trilayer and ABCA tetralayer graphene, 
either in suspended two-terminal devices, or with the spectrum modi- 
fied by moiré superlattices; longer ABC sequences have been stud- 
ied using magneto-Raman’’ and ARPES”. Here we report electronic 
transport in high-quality RG of up to N= 50 graphene layers thick, 
where we observe strong electronic correlations and phase separation, 
phenomena typically reserved for unconventional superconductors 
and heavy fermion materials. Eight RG devices were studied in detail 
in this work, with thicknesses ranging from about 2.3 nm (N= 7) to 
about 16.5nm (N= 50). 

We identified the stacking order of graphite films using Raman spec- 
troscopy and, to retain their high electronic quality, we encapsulated 
the exfoliated films with hexagonal boron nitride (hBN) crystals using 
the directional dry transfer technique’ (Methods). Figure 1d compares 
low-temperature (7=20 mK) resistivity p,,.of RG and hexagonal graph- 
ite (HG) devices of the same thickness (device 1, about 3 nm thick, 9 

CVet+ ChVb 
graphene layers) as a function of total carrier density n=—~-—. 
(Here V, and V, are the gate voltages, and C, and C, are the unit-area 
capacitances, of the top and bottom gates, respectively, and eis the 
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Fig. 1| Transport characteristics of thin RG films. a, Schematic of RG hopping 
parameters, y;. Light- and dark-grey spheres represent carbon atoms of 
sublattices A and B, respectively. b, Dispersion of the surface states including 
all the hopping parameters ina. The vertical axis is energy, £, and horizontal 
axes are in-plane momenta, p. Skew hopping y, adds an appreciable bandwidth 
2y4y,/Yo to the otherwise flat surface state bands, and introduces electron-hole 
asymmetry. The presence of y, and y; leads to trigonal warping. Blue and red 
colours indicate conduction and valence bands, respectively, and the 
double-headed arrow shows the finite band overlap between conduction and 
valence bands. c, Scenario when a bandgap A > 2y,y,/y) Spontaneously opensin 
the RG spectrum. The double-headed arrow indicates the size of the bandgap. 
d, ¢,,.as a function of total carrier density n=n,+n, measured at D=0 for device 
1with rhombohedral (black curve) and hexagonal (blue curve) crystal 
structures (both are nine graphene layers thick). n, (n,) isthe charge density 
induced by top (back) gate voltage. Left inset, optical micrograph of device 1; 
scale bar, 2 um. e, p,,.as a function of the displacement field D;n=0, T=20 mK. 
Inset graphs ind andeshowthe corresponding o,, curves onalogarithmic scale. 


elementary charge.) The RG device shows much stronger modulation 
of p,,.(n) as compared to the reference HG device, such that p,, of the 
former spans 10 0 to 23 kQ and exhibits a profound high-resistivity 
region near n=O. At the same time, the HG device displays metallic 
behaviour over the entire range of carrier densities (p,.. varies only from 
8 Oto 63), similarly to the reported dual-gated devices of tetralayer 
and hexalayer Bernal-stacked graphene”, 

Figure le shows three distinct high-resistance regions for RG device 
1(which are absent in the HG device) as a function of displacement field 
D= ah Co'y measured at n=0 (€) is the vacuum permittivity). The 
appearance of the high-resistivity region at the neutrality point (n=0, 
D=0) suggests that an energy gap opens inthe surface states spectrum, 
whereas the p,, value close to the resistance quantum h/e’ hints at 
edge-state transport caused possibly by topological character of the 
insulating state. The resistivity peak at the neutrality point was observed 
only in sufficiently thin (<4 nm) RG devices and cannot be explained 
within a free-particle model (Methods). Before discussing this 
high-resistivity feature, we consider states induced by alarge displace- 
ment field. These regions of high resistivity, observed in all RG devices, 
emerging above a critical displacement field, |D| > D, (See Fig. 2a-d), 
are in agreement with the theoretical prediction that a high enoughD 
opens an energy gap in the surface state spectrum of RG”. The effect 
is similar to that observed in bilayer graphene under displacement 
but has much stronger nonlinear screening of charges at the two 
RG surfaces (Methods). Note that the observation of these 
displacement-induced gaps requires a gapped bulk of multilayer RG 
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Fig. 2| Thickness dependence of the transport characteristics of multilayer 
RG. a-d, Conductivity maps 0,,.(n, D) for RG devices 1to 4, respectively, of 
different thicknesses as specified in the panels. e, Thickness dependence of the 
critical field D,, above which the transport gap is opened (black diamonds); 
thickness is expressed as number of layers, Nj,y.,D, values were extracted from 
linear fits of o,,.(D) at n=O near the insulating states in the maps shownina-d. 
Error bars represent uncertainties of the fitting procedure. The red curve is the 
self-consistent model (Methods). f, Arrhenius plot of o,,, for RG device 1 at 

D= 0.57 Vnm 1 (red squares), 0.62 Vnm (blue squares), 0.66 Vnm ‘(green 
squares) and 0.76 Vnm "(yellow squares). The grey pentagons are the data for 
D=0.Linear fits of high-7 regions (solid lines) give the bandgap A presented in 
theinset. The red curve in the inset is the bandgap estimation using the 
self-consistent screening model (Methods). For comparison, the thermal 
activation energy of the high-resistance state atD=0 Vnm ‘isalso plottedin 
the inset (blue star). 


(Methods). The resistivity of our fully gapped RG film (both surface 
and bulk states are gapped) is of the order of 20 kO for the thin RG 
sample at 20 mK (device 1), and 2-3 kO for the 16.5-nm-thick sample 
(device 4) at 1.5K. 

The effect of RG thickness on the displacement-field-induced band- 
gap is shownin Fig. 2a—d, which shows o,..(n, D) maps for four RG devices 
with thicknesses ranging from 3 nmto 16.5 nm (devices 1to 4). Figure 2e 
summarizes how the critical displacement field D, increases with sam- 
ple thickness. The solid line is the self-consistent theory (see Methods). 
From the 7 dependence of o,,. of 3-nm-thick RG device 1, we extracted 
the sizes of thermally activated gaps A at different D (Fig. 2f), which are 
plotted in Fig. 2finset. The bandgap observed in RG at |D| > D. isin stark 
contrast to the HG response to the displacement field where D causes 
the resistivity to decrease (Fig. le) due to the growing size of the Fermi 
surfaces of the top and bottom surface states’. The behaviour of 
0,,(D) presented in Fig. 2a—d is characteristic of the uniform ABCABC... 
stacking in our RG samples. We note that, if mixed ABC/ABA stacking 
were present in our devices, this should have prevented gap opening 
by the displacement field (Methods), which provides complementary 
evidence for complete ABC stacking in our RG devices (Extended Data 
Figs. land 2). 

Next we explore electron transport in multilayer RG using mag- 
netic field B applied perpendicular to the plane. Longitudinal (p,,.) 
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Fig. 3 | Quantum Hall effect in RG. a, Map of longitudinal conductivity o,,(n, B) 
for device 1; D=0; T=20 mK. Arrows and numbers indicate the filling factors of 
quantum Hall states. b, o,,,(v, B) for the hole side (same dataasina).c, 
Differential do,,./dv(v, B) map for the electron side (data froma). Red arrows 
mark the positions of LL crossings. d, Calculated spectra of LLs in multilayer 
RG, using gapped and gapless band structures at low (coloured curves) and 
high (black) energies, respectively. The free-particle and gapped regions are 
separated by phase transitions. Red and blue lines, depending on the order 
parameter, correspond to electronic states with opposite spins (quantum spin 
Hall) or of opposite valleys (layer antiferromagnetism). F,, Fermi energy. 

e, Effect of quantum spin Hall order parameter onthe Landau spectrum. 
Numbers indicate the filling factors. LL 7 and LL 8 refer to the orbital indices of 
the zeroth LL. * (red lines) and ¥ (blue lines) label spin-split LLs. Aggy, is the QSH 
order parameter, and K and K’ are the opposite valleys. The dotted line 
demarcates the transition between gapped and free-particle regimes. f, Same 
as ine but for layer antiferromagnetism. Solid (dashed) coloured lines refer to K 
(K’) valleys. 


and transverse (,,) resistivities of device 1 are shown in Extended Data 
Fig. 3a as a function of Bat n= 2.3 x 10" cm” and D=0. Shubnikov-de 
Haas oscillations emerge at B ~1T, and well-quantized p,, plateaux 
develop for B>3 T manifesting the onset of the quantum Hall effect 
(QHE), such that under a strong magnetic field the electronic spectrum 
of 2D surface states collapses into a series of discrete Landau levels 
(LLs), leading to quantized p,,. In high B (see, for example, Extended 
Data Fig. 3b for B=10T), p,,(n) plateaux appear at even fractions of h/e’, 
indicating a two-fold (valley) degeneracy of the LLs. To examine the 
quantization, Fig. 3a shows the Landau fan diagram o,,(n, B) for device 
lat D=0. We note strong electron-hole asymmetry in the diagram: 
the electron side exhibits straight LLs (blue lines follow the slopes 
B/n=h/ev, where vis the filling factor) fanning out from the neutral- 
ity point, while the hole side shows a criss-cross pattern. We observed 
similar behaviour in RG devices of different thicknesses, which is con- 
sistent with the surface origin of the LLs (Extended Data Fig. 4). The 
QHE has also been reported in thin films of Bernal-stacked graphite”. 
Inthe latter case, the quantization was due to the formation of standing 
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waves in the bulk. In contrast, our multilayer RG has a gapped bulk but 
conducting surfaces, which should exhibit the QHE if their electronic 
quality is sufficiently high. 

InN-layer RG, a Berry phase +Nrt around the opposite K points leads to 
N-degenerate zero-energy LLs for each spin and valley, which are local- 
ized on the opposite surfaces for opposite valleys”. A free-particle 
spectrum of LLs in RG calculated using the same method as in ref. *° is 
plotted in Extended Data Fig. 4a. Typically for semimetals, conduction- 
and valence-band LLs make numerous crossings at moderate B, which 
for multilayer RG happens on the hole side of the spectrum. In the 
absence of a displacement field, all LLs are valley degenerate. In addi- 
tion, low-index valence-band LLs are triply degenerate, reflecting the 
triply degenerate maxima of the valence band (Fig. 1b). They are split 
at higher B where the three Fermi surfaces of the valence band merge 
into one. For device 1 (9 layers) in Fig. 3a, the total number of the zeroth 
LLs is 36 (2 spins, 2 valleys and 9 orbitals), and the lines corresponding 
to filling factors up to +18 come from these zeroth LLs. Lines on the 
electron side, which trace v= 8, 12, 16,... are well pronounced. These 
filling factors in multiples of 4 can be explained by relatively large Zee- 
man gaps between two-fold degenerate orbital levels of zeroth LLs. A 
pronounced gap at v=—18 followed by a series of crossings below 8 T 
could be traced back to the crossings between the 2N-fold degenerate 
zeroth LL and the valence-band LLs. The robust v=~—18 state emerges 
when these crossings stop (Extended Data Fig. 4a). When only asingle 
gate is used (Extended Data Fig. 5), the valley degeneracy becomes 
lifted, and a robust gap appears at v = —7 (for 7-layer-thick device 7), 
v=—9 (for 9-layer device 1) and at v= —11 (for 11-layer device 6). This 
provides additional support for our assignment of the observed spec- 
trum tothe N-degenerate zeroth LL, whichisinturna manifestation of 
an Ni Berry phase in N-layer RG. 

Although the free-particle theory adequately describes the experi- 
mental results at high doping (Fig. 3a, d above |n| > 2.5 x 10” cm”), 
it fails to explain the pronounced gaps observed at v= 0 and v=-6 
(Fig. 3a). Apart from the crisscross pattern on the hole side (Fig. 3b), 
which agrees with the free-particle theory, there is a series of crossing 
points on the electron side at the symmetry broken states v= 10, 14, 
18, 22... The crossings are better seen if we replot the Landau fan as a 
function of v (Fig. 3c; the crossing points are marked with red arrows). 
These crossings also cannot be explained using the free-particle picture. 
To account for the latter features, we suggest a Spontaneous appear- 
ance of an interaction-induced spin and/or valley symmetry-breaking 
order parameter. It leads to a bandgap when the resulting band split- 
ting exceeds the single-particle overlap between the conduction and 
valence bands, thus allowing the observation of quantum Hall states at 
v=Oand —6 (Fig. 3a, d). With increasing B, the gap switches from v=0 
to v=—6. This can be explained by large orbital magnetic moments 
that have opposite signs for opposite spins (or valleys) and lead to 
closure of the v= 0 gap, as shown in Fig. 3d. The bandgap disappears 
with increasing doping, but the observed crossings on the electron 
side (Fig. 3c, and also Extended Data Fig. 4c, d for 3.3-nm-thick device 
5) suggest that a small interaction-induced order parameter remains 
present up ton=2.5 x10" cm”. 

The interplay between the orbital, valley and spin degrees of freedom 
leads to several possible states with spontaneously broken symmetry in 
multilayer RG. They include quantum valley Hall, quantum anomalous 
Hall, layer antiferromagnetic insulator and quantum spin Hall states? 
(see Methods). The crossing points on the electron side (Fig. 3c, and 
Extended Data Fig. 4d for similar results in device 5) can be explained 
by the phase transition from spin-polarized states due to Zeeman split- 
ting at higher electron concentration to quantum spin Hall states at 
electron concentration n< 2.5 x 10" cm” (Fig. 3e). The quantum spin 
Hall phase is accompanied by a giant Berry curvature and a related giant 
orbital magnetic moment (the same in both valleys but opposite for 
the two spin directions)’. The observed crossings cannot be explained 
by any other of the conjectured phases listed above. For instance, the 
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Fig. 4| Hysteretic behaviour of the insulating state in multilayer RG. 

a, p,.(n) of dual-gated device 5 (RG about 3.3 nm thick) at D=0; data were taken at 
three different temperatures (top, 0.25 K; middle, 4 K; bottom, 10K). The solid 
(dashed) curves indicate positive (negative) sweep directions; the coloured 
areas highlight the hysteresis. b, Hysteresis in B, of the same device used ina. 
Panels showp,,(B,); the data were taken at the same three temperatures as ina, 
withn values shownat the top right of each panel. Coloured arrows show sweep 
directions. c, Hysteresis in B, for device 6 (single-gated RG about 3.6 nm thick). 
Px.(B\) is shown for six different temperatures; the coloured areas highlight the 
hysteresis. d, Phase diagram of the critical behaviour inc: the temperature at 
which the insulating state appears depends on B, (error bars are due to 
hysteresis). The red curve is the fit 7= (By — B,)", where By =4 T is the critical field 
at T=OK, and a= 0.4 is the critical exponent. 


layer antiferromagnetic phase leads to lifted valley degeneracy and 
the presence of gaps and crossings at both even and odd filling fac- 
tors (Fig. 3f), which contradicts the experiment. Note that there is no 
evidence that the observed quantum spin Hall phase becomes stronger 
towards zero doping, because the development of another competing 
order cannot be excluded. 

In the absence of a magnetic field, thin (<4 nm) RG devices reveal an 
intrinsic insulating state at the neutrality point (Fig. 1d, e). Extended Data 
Fig. 6ashows the temperature dependence of p,,(n) around the insulating 
state at D=Oin another RG device about 3.3 nm thick. At Tbelow10K, the 
resistance shoots up by orders of magnitude, showing thermal activation 
behaviour consistent with a gap of 2-3 meV (Extended Data Fig. 6a). We 
note that this value represents an effective transport gap, after accounting 
for the lifted overlap of about 2y,y,// (Extended Data Fig. 9). 

Surprisingly, pronounced hysteretic behaviour is observed in the 
resistivity of multilayer RG around the neutrality point as a function 


of doping (see Fig. 4a and Extended Data Fig. 6a, b). The hysteresis 
rapidly decreases with increasing 7 and disappears if the RG enters a 
metallic state. Note that the hysteresis strongly favours electron dop- 
ing, reflecting the electron-hole asymmetry of the RG band structure. 
In addition to the hysteresis as a function of n, we also observed pro- 
nounced hysteresis in response to both perpendicular (B,) and parallel 
(B) magnetic fields. B, suppresses the insulating state (Fig. 4b) but 
the accompanying Landau quantization unfortunately complicates 
analysis of the observed 7 dependence. In contrast, measurements in 
B, (Fig. 4c), where the orbital degree of freedom remains unaffected, 
provide aclearer picture. With increasing T, both the hysteresis and the 
transition field rapidly decrease, and the B,-7 phase diagram follows the 
power-law T= (B, — B,)* where a= 0.4 is the critical exponent (Fig. 4d). 
Our fit is comparable to those reported for the ‘hidden order’ phase 
in strongly correlated heavy fermion systems like”” URu,Si,. The pro- 
nounced hysteretic behaviour with abrupt switching hints at spontane- 
ous formation of insulating electronic phases that are separated into 
domains, similar to those observed in other correlated systems such as 
manganites, cuprates, and transition metal oxides”. The hysteresis as 
afunction of both electric and magnetic fields is characteristic of mul- 
tiferroic materials, and advanced scanning probe techniques could be 
helpful in identifying the nature of those electronic domains in future. 

The present work offers a novel experimental system in which to 
explore strongly correlated phenomena that does not rely on spin-orbit 
or exchange interactions in transition metal oxides”® or on precise 
structuring of graphene superlattices” ». Electronic coupling between 
top and bottom surfaces in thin RG films is similar to that observed in 
thin films of topological insulators*?** and could be responsible for the 
spontaneous gap opening in our case. However, this coupling cannot 
explain the observed hysteretic behaviour. We believe that multilayer 
RG provides an interesting system in which to explore strong correla- 
tions, quantum criticality and other many-body phenomena, usually 
reserved to f-element-based heavy fermions or d-element-based transi- 
tion metal oxides. 
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Methods 


Device fabrication 

The RG films were obtained by mechanical exfoliation of natural graph- 
ite flakes (Graphenium Flakes 25-30 mm, NGS Naturgraphit) onto a 
SiO, (290 nm)/Si wafer. Exfoliated graphite films often show domains 
of various stacking orders, which can be identified by Raman spectros- 
copy. By mapping the ratio of the integrated area of the low-frequency 
component (ranging around 2,670-2,700 cm) and the high-frequency 
component (ranging around 2,700-2,730 cm) of the Raman 2D band, 
we can visualize the distribution of rhombohedral stacking in the films’. 
Raman spatial maps were taken with a step size of 0.5 pm. 

RG films often revert back into Bernal stacking during the flake trans- 
fer process, and only asmall region of the flake remains in the original 
stacking order. The transition between rhombohedral stacking and 
Bernal stacking occurs during the shift of graphene layers along arm- 
chair directions, whereas it does not happen along zig-zag directions. 
To retain the high electronic quality of RG, we used a directional van der 
Waals assembly technique’ as follows. First, we identified the crystal 
orientation of graphite from the edge chirality of graphite films using 
Raman spectroscopy. Next, we used a polymer (a polymethyl meth- 
acrylate (PMMA)/polydimethylsiloxane (PDMS) stack mounted ona 
glass slide)-mediated dry transfer method to encapsulate RG films 
in hexagonal boron nitride (hBN). During the encapsulation process, 
the pressing-down direction of the PDMS/PMMaA layers was controlled 
along the zig-zag direction of the graphite films, so that the stacking 
transformation could be avoided. Then, we used the above-mentioned 
Raman-ratio map again to confirm that rhombohedral stacking was 
retained inthe hBN-encapsulated graphite. Finally, multiterminal Hall 
bar devices were made from the hBN/RG films/hBN heterostructures 
using standard electron-beam lithography. Examples of our Raman 
analysis during device fabrication are presented in Extended Data Fig. 7 
for device 6 (3.6-nm-thick RG). 


Surface and bulk contributions 

Extended Data Figure 8 shows the temperature dependence of resis- 
tivity, p,,.(7), at n= 0 for 16.5-and 3.3-nm-thick RG (devices 4 and 5, 
respectively). In contrast to the metallic behaviour of Bernal-stacked 
graphite”, we observed a semiconductor-to-metal-transition with 
decreasing temperature. We attribute this transition to the interplay 
between three parallel conduction channels: the bulk and the two sur- 
faces. Infinitely thick RG has 3D Dirac cones in its bulk energy spectrum’, 
but in films of a finite thickness (with N layers), these Dirac cones are 
gapped by A = 2.1 eV per N (see below). At high temperatures, the RG 
bulk conducts as a result of thermal activation across this gap, and 
this conductivity dominates over the 2D surface states. With decreas- 
ing 7, the bulk contribution decreases and, finally, below some criti- 
cal temperature 7, (about 20 K and 40 K for 16.5-nm and 3.3-nm RG, 
respectively), the contribution from the surfaces dominates, and p,,.(7) 
shows metallic behaviour. This temperature behaviour of resistiv- 
ity is similar to that observed in 3D topological insulators, which is 
also ascribed to competing conductances of bulk and surface states*. 
The observed T. is smaller for thicker samples, in agreement with the 
expected 1/N dependence of the bulk gap. The observed 7 dependence 
indicates that at low temperatures (7 < 7,), the conductivity through 
3D bulk bands is suppressed and mostly 2D surface states contribute 
to electrontransport. Thermal activation within the RG bulk limits the 
interval of exponential 7 dependence, preventing us from evaluating 
the bulk gaps at high D. 


Tight-binding model 

To describe multilayer RG, we employ the Slonczewski-Weiss—-McClure 
(SWMC) parameterization of graphite °™***, with interlayer hopping 
parameters that take into account the lattice arrangement of rhombo- 
hedral graphite, as shown in Fig. 1a. Multilayer RG has bulk subbands 


shifted away from zero energy, leading to a bulk gap of about 31y,/N 
for N layers, making only the surface bands relevant for electronic 
transport at low T. The surface states have shallow dispersion for 
quasi-momenta p < p, = y,/v, where v is the Dirac velocity determined 
by intralayer nearest-neighbour hopping yo. There is also a quad- 
ratic term in the dispersion of surface states?°, p?/(2 m,) with 
m; =2 v?/y(2 V,/Vq + 6/Y,) = (0.4 m.) 1,which breaks the electron-hole 
symmetry, and trigonal warping terms controlled by y, and y;. Herem, 
is the free electron mass. Since the SWMC parameters are poorly known 
for RG, we use the conventional HG values””* for the larger hopping 
amplitudes (yp = 3.16 eV, y,; = 0.39 eV, y; = 0.25 eV) while tuning the 
unknown y, to -0.01 eV and m, to 0.4m, to better describe the experi- 
ment. We also set the energy difference of dimerized (that carry bulk 
states) and non-dimerized (that carry surface states) sublattices to 
zero, 6=0,and tuned y, to about 0.13 eV. These parameter values have 
been chosen by comparing Fig. 3a with Extended Data Fig. 4a and match- 
ing the electron density of the first crossing of valence and conduction 
bands (n=-6.3 x 10” cm”), which corresponds to the valence-band 
population where the chemical potential is at the bottom 
of the conduction band. Other crossings lead to extra constraints that 
can be satisfied reasonably well at positive filling factors (4 < v< 40, 
where crossings are in the range 5 T > B> 2 T), while the matching 
becomes worse near zero filling and at larger magnetic fields, suggest- 
ing an increased role of interaction effects. 


Bandgap induced by displacement field 

Introducing a vertical electric displacement field D via asymmetric 
gating leads toa potential difference 4 between the top andthe bottom 
layers, resulting in a density redistribution that strongly screens the 
external electric field. The dispersion of surface states with non-zero 
Ais shown in Extended Data Fig. 9. Note that the difference between 
the energies of valence- and conduction-band states at the K point 
equals A (only y, vertical hopping and the 4 potential contribute to the 
Hamiltonian at K points, leading to states localized at A, and By sublat- 
tices with energies +4/2, while all the other states belong to dimers, 
connected by y, vertical hopping and having energies +y,). After cal- 
culating the energy dispersion, the screening electron density accu- 
mulated at layer iis calculated as 


ake 
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where iis the layer number in N-layer RG, the integration is over the 
Brillouin zone (BZ), / stands for band index, w! (k) and Wi (k) are 
normalized wavefunctions of two sublattices in layer i, fis the Fermi 
function, £, isthe Fermi level, €,is the energy of band /, and -1subtracts 
a homogeneous charge jelly that neutralizes the Fermi sea electron 
density. Denoting the total screening charge accumulated near the 
top and bottom surfaces by n§"(4) and n§(4) respectively, we relate 
the displacement field to the charge density and the potential asym- 
metry as 


~e[n, = ny] _ )_ elie "(A) - np"(A)] , eA 
2&9 2&9 d 


where eis the electron charge, dis the thickness of the graphite film, and 
€,) isthe vacuum permittivity. We employed the Hartree approximation 
and used the fact that the dominant screening charge is localized at the 
surface layers. By varying the potential difference of the surface layers, 
tuning the chemical potential to maintain overall charge neutrality, and 
finding the electron density distribution, we determined the relation 
between the bandgap and applied external displacement field, as shown 
in Fig. 2f inset. The bandgap 4, defined as the difference between the 
conduction-band minimum and the valence-band maximum, is shown 
in Extended Data Fig. 9. 
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Inthe absence of displacement field D, the conduction and valence 
bands of RG overlap. Therefore, one has to apply a field D larger than 
some critical D,,;,so that the potential asymmetry exceeds the band- 
width of overlapping, and drives the system from a semimetallic toa 
semiconducting phase, Extended Data Fig. 9. Calculated values of Dic 
are shown in Fig. 2e and compared to experimental results. When the 
total electron concentration is zero, n,—n, = 0, the Fermi level is inthe 
gap, resulting in the experimentally observed transport gap presented 
inthe main text. 


Possibility of stacking faults 
Itis important to havea reliable method to check for possible stacking 
faults inthe sample. Apart from Raman’ (Extended Data Fig. la-d) and 
electronic Raman scattering signatures”, measurements of an in-plane 
transport gap opened by the vertical displacement field (asymmetric 
gating) provides another diagnostic tool (Extended Data Fig. le-g). 
Our calculations reveal that it is impossible to open a transport gap if 
stacking faults or thicker inclusions of ABA stacking are present. We 
considered the most elementary ABCBA stacking fault that joins ABC 
stacking with its mirror-image CBA stacking (Extended Data Fig. 2a); 
a Bernal stacking fault that joins two ABC multilayers, as shown in 
Extended Data Fig. 2d for the ABCBCA multilayer; and surface stack- 
ing fault ABCAC (Extended Data Fig. 2g). The bold characters indicate 
the type and position of stacking faults. 

At the K points of the Brillouin zone of RG, the in-plane hoppings y, 
destructively interfere, leaving only y, as a high-energy scale. For the 
twin-boundary stacking, the low-energy states are localized on sublat- 


tices A,, A;andB;, and onazero-energy eigenstate(Y,, , WY, ) = (+ - +) 
of aB,A,B, trimer, which we denote as B, — B,, Extended Data Fig. 2a. 
Away from the K point, chiral decomposition” leads to the appearance 
of effective 2- and 3-layer ABC chiral subsystems with obvious gener- 
alization to films thicker than 5 graphene layers. So, there are four 
low-energy sub-bands: two with energies growing away from the K 
point (conduction-band type) and two with energies decreasing away 
from the K point (valence-band-type). Similarly to RG discussed before, 
the hopping y, leads to an overlap of valence and conduction sub-bands 
of the order of 20 meV. Application of the vertical electric field (creat- 
ing layer potentials U, < U, <...< U;) splits the four almost degenerate 
sub-bands into two outer valence and conduction bands, localized on 
A,and A, (with energies U, and U, at the K point), and two inner valence 
and conduction bands, localized on B, and B,— B, (with energies U; and 
(U,+ U,)/2 at the K point). The energy splitting of the two inner bands 


: e* ding 
iSU3— (U + Us)/2= 5, %, 
layers, and it only depends on the charge density of the third layer, ng... 
For the ABCBCA stacking fault in Extended Data Fig. 2d, the 
low-energy states at the K point are localized on sublattices A,, B;, A, 
and B,, with the splitting of the inner band determined by the electric 
field between layers 3 and 4. In both ABCBA and ABCBCA stacking fault 
cases, to open a gap one requires a displacement field sufficient to 
overcome the doubled screening of four low-energy bands and, addi- 
tionally, to create a potential difference of about 20 meV between the 
two consecutive layers forming a stacking fault. Such displacement 
fields would strongly exceed the values plotted in Fig. 2e, and are 
beyond our experimental reach. Large but experimentally attainable 
displacement fields would split the inner B, and A, bands at the K point, 
as shown in Extended Data Fig. 2e, f, but such high fields would also 
substantially increase the dispersion of these bands, resulting in band 
overlap near p= p,, thus closing the gap. As for the ABA surface stack- 
ing fault in Extended Data Fig. 2g-i, the ABA surface bands overlap with 
a Dirac cone having velocity v/./2, which shunts the gapped state. 
Thus we conclude that any stacking faults make opening of a trans- 
port gap impossible. We also note that stacking faults would, generi- 
cally, lead to asymmetric behaviour with respect to reversal of the 
direction of D. Calculated dispersions with non-zero displacement field 


where d,=0.335 nm is the distance between 


are shown for the twin-boundary defect in Extended Data Fig. 2b, c; for 
the stacking fault buried inside the ABC bulk in Extended Data Fig. 2e, f; 
and for the surface stacking fault in Extended Data Fig. 2h, i. Examples 
of transport data for multilayer graphite devices with stacking faults 
are shown in Extended Data Fig. 1. The presence of faults is established 
by measuring Raman spectra. 


Landau levels in multilayer RG 

We have calculated the LL spectrum by employing the same method 
as in ref. 7°. Numerical diagonalization is performed ina regularized 
finite basis consisting of oscillator states for each sublattice compo- 
nent, namely (Qo, Py «.., Pyotr? for sublattice A, and (Qo, g,, 1 Dyin 
for B, where Ny = 250 is a cut-off LL index, sufficient for convergence 
at B>0.5T. 

To qualitatively understand the LL spectrum, let us start with the 
minimal model” with only the nearest-neighbour in-plane (y,) and 
interlayer (y,) hoppings. In the absence of a displacement field (D=0), 
the low-energy LL spectrum consists of 4N-degenerate levels at zero 
energy, which we refer to as zeroth LLs (where 4 is due to double-spin 
and double-valley degeneracy, and Ncomes from N-fold orbital degen- 
eracy), plus four-fold degenerate levels arising from the conduction 
and valence bands that disperse as E*= +B”. Within the two-band 
model, the vector wavefunctions of zeroth LLs of one valley reside on 


one (for example, the top) surface, y,, = a i=O,...,N-1,while for 


the other valley the wavefunctions sit on the opposite (for example, 


the bottom) surface, Ye j= (>, . Therefore, for zeroth LLs the valley 
k 
degeneracy is equivalent to the top/bottom surface degeneracy. 

The minimal model is insufficient to describe the experimental 
results, making it necessary to consider further electron hopping 
amplitudes. The presence of inter-layer same-sublattice hopping y, 
leads to strong electron-hole asymmetry and breaks the orbital 
degeneracy ofzerothLLs, leadingtodispersionE? = nhw,,n=0,...,N-1, 
with a,=e B/m,, m,= 0.4 m,. The conduction-band LLs continue the 
sequence of four-fold degenerate LLs as F, =nhw,,n=N,N+1,....The 
valence-band dispersion is non-monotonic and produces a series of 
four-fold degenerate LLs F,, = nhw,, n=1, 2, ...near zero energy, as well 
as a sequence of 12-fold degenerate levels at higher energies, originat- 
ing from the three equivalent valence-band maxima caused by trigonal 
warping (hoppings y;, y2). 

At moderate B, valence-band LLs bend down, forming numerous 
crossings with zeroth LLs and conduction-band LLs™. This explains the 
multiple crossings of the LLs onthe hole side in Fig. 3a, b. The two-fold 
degeneracy of the filling factors (for example, v= 6, 8,10...) at D=O 
can be attributed to the preserved valley degeneracy, while the gaps 
are given by combination of the orbital splitting (v= 6,10, 14... for odd 
Nand v=4, 8, 12, 16... for even N) and spin splitting (v= 4, 8, 12, 16... 
for odd Nand v=6, 10, 14... for even N). Although the theory qualita- 
tively agrees with the experiment, it predicts a twice higher value of B 
at which the last crossings occur. This discrepancy can be attributed 
to exchange interactions. 

The filling factor at a specified chemical potential is determined as 
the number of filled LLs in the regularized basis, with half the total 
number of levels subtracted to account for the background charge 
neutralizing the Fermi sea. Normally, this would place the zero electron 
density between LLs originating from valence and conduction bands 
that form the pairs related by approximate electron-hole symmetry 
(Y, Ya) > (Ya, - Vy). However, there are N additional (spin and valley 
degenerate) zero LLs (numbered as O,..., N— 1) with wavefunctions 
localized only on one of the sublattices. At low B, zero LLs overlap with 
valence-band LLs, leading to complicated counting of the filling factors 
but, for B210 T, conventional valence-band LLs are below the zero 
energy, leading to zero electron density (v = 0) located in the middle 
point of zeroth LLs. This middle point is located either in between the 


Zeeman-split zeroth LLs with index (N-1)/2 for odd N, or between the 
orbital-split zeroth LLs with indices N/2 - 1 and N/2 for even N. 


Spin- and valley-polarized states 
Coulomb and phonon-mediated interactions between electrons can 
lead to a multitude of patterns for spontaneous symmetry breaking. 
These include breaking a ‘top-bottom’ symmetry of the two surfaces, 
aspin/valley symmetry, a six-fold rotation symmetry (nematic phase), 
alattice translational symmetry (charge density wave), or a U(1) gauge 
symmetry (superconductivity)** *°. The presented experiments provide 
evidence that symmetry breaking occurs for a limited range of sample 
thicknesses (<10 layers), hinting at the importance of interaction 
between the surfaces, which suggests a ‘top—bottom’ type of symme- 
try breaking order. The candidates are thus the quantum valley Hall 
(VH; ferroelectric) phase with order parameter AY" = A 0, quantum 
layer antiferromagnetism (LAF) AF = J o®¥*" ® oS?!" the quantum 
spin Hall (QSH) phase (also called the ‘spin flux’ phase) 
ASH = J gl2ver @ gPi"® gv2ll€y or the quantum anomalous Hall (AH) 
phase A*" = A g!2¥* @ o¥"""°Y, Here, o, is a Pauli matrix. In the absence 
of aB field, only the relative orientation of spins matters, while the 
presence of a B field lifts this degeneracy to fix an optimal ‘canted’ 
orientation of spins“. The AH phase breaks the time-reversal symme- 
try and implies a Hall signal at B=0, which has not been observed. The 
VH phase is the one induced by displacement field and discussed above, 
but it is not continuously connected to the observed spontaneous 
insulating phase, and, instead, competes with it. So we are left with two 
candidates: LAF and the QSH effect. Both phases lead to giant Berry 
curvatures and, consequently, to large orbital magnetic moments, 
which peak near the three degenerate maxima of the valence band”. 
The orbital magnetic moment has opposite signs in the two valleys for 
A or for the gate-induced displacement field. In the case of A@*#| the 
orbital moment becomes the same in both valleys but opposite for the 
two spin directions. The orbital moment leads to pronounced lifting 
of the 12-fold degeneracy of LLs originating from valence-band maxima, 
which results in two groups of six-fold LLs. This explains the observed 
robust gap at v= — 6 (Fig. 3a, b, d). The six unoccupied valence-band 
LLs lead to the layer-antiferromagnet (three K* and three K’Y levels) 
in the case of the quantum LAF order parameter and toa ferromagnet 
(three K* and three K’*) in the case of QSH ordering. Canting effects 
can be neglected since the magnitude of the order parameter is much 
larger than the Zeeman splitting. 

The effect of order parameters on the spectrum of zeroth LLs 
amounts to valley-/spin-dependent shifts for LAF/QSH phases, which 


follows from localization of zeroth LLs on the top/bottom surfaces in 
the K/K’ valleys. Hence, A'“‘ breaks the valley degeneracy, leading to 
the appearance of gaps at odd filling factors (Fig. 3f) whereas the valley 
degeneracy remains intact for A@" (Fig. 3e). The experimental data in 
Fig. 3c are consistent with the QSH order for electron doping below 
2.5 x 10” cm”, where the orbital and spin magnetic moments align. 
Note that a QSH order parameter fits finite magnetic field data at sub- 
stantial electron doping, and does not necessarily connect to the 
observed spontaneous gap near charge neutrality. 
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Extended Data Fig. 1| Effect of stacking sequence on the displacement-field- 
induced bandgap. a, b, Raman maps of device 3, a 6.5-nm graphite flake with 
domains of differing local stacking sequence, shown before (a) and after (b) 
encapsulation. The colour coding indicates the ratio of the integrated area of 
the low-frequency component of the graphite Raman 2D band (ranging around 
2,670-2,700 cm’) to that of the high-frequency component (ranging around 
2,700-2,730 cm’). Coloured dots ina and b mark the positions where the 
Raman spectra shown incand d respectively were taken. Scale bars, 10 pm. 
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c,d, Typical 2D Raman peaks of HG (black curves A and a), RG (red curves Cand 
c) and graphite of mixed ABA and ABC stacking (blue curves B and b), before (c) 
and after (d) encapsulation. Laser wavelength, 532nm.e, p,,.as.a function of D 
for Hall bar devices made using domains at point b (light blue) and point c (red), 
respectively, 7=1.6 K. Inset, optical micrograph of the devices before shaping 
them into the Hall bar geometry. f, g, Resistivity maps /,.(n,, n,) of graphite with 
ABC stacking (f) and mixed stacking (g); 7=1.6K, B=OT. 


Extended Data Fig. 2 | Band structures of multilayer RG with stacking 
faults. a, Schematic of twin-boundary defect. b,c, Calculated spectra for the 
ABCABCABACBACBACBA sequence at positive (b) and negative (c) D. 

d, Schematic of a buried Bernal stacking fault ABCBCA. e, f, Calculated band 
structure for sucha defect: ABCABCABABCABCABCA sequence at positive 
(e) and negative (f) D. g, Schematic of surface stacking fault ABCAC. h, i, 
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Spectra for the ABCABCABCABCABCABA sequenceat positive (h) and 
negative (i) D. In all the calculations we used D=+1 Vnm". See Methods section 
‘Possibility of stacking faults’ for details of nomenclature used ina, d, g. Fis the 
band energy, p, is the in-plane momentum, and p,=y,/v, where vis the Dirac 
velocity. 
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Extended Data Fig. 3 | The quantum Hall effect in thin RG. a, Hall resistivity 


Px, (red curve) and longitudinal resistivity p,, (black curve) as a function of 
magnetic field B measured at 20 mK for the same device as in main text Fig. 3 
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(device 1);n=2.3x10" cm’, D=OVnm'.b, p,, and p,,as a function of n for the 
same device as ina; D=0,B=10T, 7=20 mK. 
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Extended Data Fig. 4| Landau levels from surface states of multilayer RG. 

a, Calculated free-particle spectrum. b, Conductivity map o,,.(n,, ,) for 
3-nm-thick RG (device 1) measured at B=9 T and T=20 mK. Note that, fortwo 
independent surfaces, LLs should form sets of horizontal and vertical lines. The 
observed behaviour suggests that top and bottom surfaces of our RG devices 
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are nearly independent 2D systems. c, Landau fan ,,(n, B) for device 5 
(3.3-nm-thick RG); D=0, T=0.25K.d, Differential dp,,,/dv(v, B) map onthe 
electron side (same device as inc). The red arrows indicate LL crossings. 

e, 0,,(n,, B) for device 8 (7.2-nm-thick RG) measured at 7=1.7 K.f, o,,.(n, B) for 
device 4 (16.5-nm-thick RG) atD=Oand T=0.25K. 
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Extended Data Fig. 5 | Single-gate (D0) Landau fan diagrams highlighting the robust v=-—N quantum Hall state in N-layer-thick RG. a, Conductivity map 
0,,(v, B) for 7-layer-thick RG (device 7). b, Conductivity map o,,(v, B) for 9-layer-thick RG (device 1). c, 0,,(v, B) for 11-layer-thick RG (device 6). 
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Extended Data Fig. 6| See next page for caption. 
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Extended Data Fig. 6| Insulating states and hysteretic behaviour in 
multilayer RG. a, b, Additional data for 3.3-nm-thick RG (device 5,sameasin 
main text Fig. 4a, b).a, Main panel, temperature dependence of p,,.(n) around 
the insulating state, B=OT. Inset, Arrhenius plot for the peak resistivity, 
indicating the presence of a bandgap of about 2-3 meV. The yaxis shows peak 
resistivity, and red line indicates the fit. b, Main panel, histogram of the 
conductivity values found on the hysteretic curves 0,,.(n), suchas those shown 
in the inset. Eighty-three such curves were used to make the histogram, where 
data from forward and backward sweeps are plotted in blue and red, 


respectively. D=0, 7=250 mK, B=0.c, Hysteresis (shaded red) inp,,(B) 
observed in device 7 (2.3-nm-thick RG) at the charge neutrality point; 

d, hysteresis (shaded red) inp,,.(n) at B=5T. Forcandd, 7=250 mK. 

e-g, Hysteretic behaviour of device 1 for different cooling cycles. No noticeable 
hysteresis was found for the first cooling event (e). Hysteresis in p,..(n) (f) and 
Px(D) (g) is clearly seen after another cooling. Solid (dashed) lines indicate 
positive (negative) sweep directions and the coloured areas highlight the 
difference between the sweep directions. 7=20 mK; B=OT. 


Extended Data Fig. 7 | Stacking order of device 6 at different fabrication 
stages. a, Optical image of the graphite flake (left) and the corresponding 
Raman map withastep size of 0.8 pm x 0.8 pm (right). Scale bar, 10 pm. 

b, Optical image of the graphite flake encapsulated by hBN (left) andthe 
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area of the low-frequency component (ranging around 2,635-2,665 cm) to 
that of the high-frequency component (ranging around 2,665-2,695 cm”) of 
the graphite Raman 2D band. The coloured dots in a-c mark the positions 
where the Raman spectra shown in ewere taken. d, Optical image showing that 


corresponding Raman map withastep size of 0.7 ym 0.7 um (right).Scalebar, | the Hall bar device is made from the ABC stacked region. Scale bar, 10 pm. 


10 um. c, Optical image of the finished Hall bar (left) and the corresponding 
Raman map with astep size of 0.5 um x 0.5 pm (right). Scale bar, 4 ym. The 
colour coding of the Raman maps ina-c indicates the ratio of the integrated 


e, Typical 2D Raman peaks of RG (dots a, cand d) and of graphite of mixed ABA 
and ABC stacking (dot b). Laser excitation wavelength, 633 nm. 
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Extended Data Fig. 8 | Temperature dependence of resistivity. a, p,,.asa 
function of Tat zero gate doping for RG device 4 witha thickness of 16.5nm 
(blue solid curve), for RG device 5 witha thickness of 3.3 nm (red solid curve), 
and for 6-nm HG (grey dashed line). b, Same data asa but plotted onalog scale. 
While cooling down, p,,.of RG first increases for T> T.,and then decreases, in 
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sharp contrast to the monotonic decrease of p,,. for HG. The critical 
temperature 7, decreases with increasing thickness of RG. Besides the 
presence of T., p,,.of the 3.3-nm RG device 5 shows a sharp increase for 7<6K 
owing to the phase transition to the insulating state. 
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Extended Data Fig. 9 | Bandgap opening by displacement field. a, Band 


° in transport measurements. b, Calculated dependence of dand4Aon 
dispersion of RG under an applied displacement field; bandgap 4ismaskedbya —_ displacement field for N=9 layers. 
bandwidth of 2y,y,/y) such that only the effective gap A = A - 2y,Y;/Yois visible 
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Two-dimensional atomic crystals can radically change their properties in response to 
external influences, such as substrate orientation or strain, forming materials with 


novel electronic structure’ >. An example is the creation of weakly dispersive, ‘flat’ 
bands in bilayer graphene for certain ‘magic’ angles of twist between the orientations 
of the two layers®. The quenched kinetic energy in these flat bands promotes 
electron-electron interactions and facilitates the emergence of strongly correlated 
phases, such as superconductivity and correlated insulators. However, the very 
accurate fine-tuning required to obtain the magic angle in twisted-bilayer graphene 
poses challenges to fabrication and scalability. Here we present an alternative route to 
creating flat bands that does not involve fine-tuning. Using scanning tunnelling 
microscopy and spectroscopy, together with numerical simulations, we demonstrate 
that graphene monolayers placed on an atomically flat substrate can be forced to 
undergo abuckling transition’ ’, resulting in a periodically modulated 


pseudo-magnetic fiel 


d!°* which in turn creates a ‘post-graphene’ material with flat 


electronic bands. When we introduce the Fermi level into these flat bands using 
electrostatic doping, we observe a pseudogap-like depletion in the density of states, 
which signals the emergence of a correlated state”. This buckling of 
two-dimensional crystals offers a strategy for creating other superlattice systems and, 
in particular, for exploring interaction phenomena characteristic of flat bands. 


Flat bands facilitate the emergence of strongly correlated electronic 
phases!*”°. A celebrated example is the Landau-level (LL) sequence of 
magnetically induced flat bands that can host correlated phases, such 
as fractional quantum-Hall states”’” or magnetically induced Wigner 
crystals”?. Magnetically induced flat bands, however, have limited 
applicability because the broken time-reversal symmetry precludes 
the emergence of certain correlated states, such as superconductivity. 
More recently, twisted bilayer graphene that is finely tuned to a magic 
angle where a flat band emerges has introduced anew platform for the 
creation of correlated phases*». 

Here we explore an alternative path to flat bands that does not 
require fine-tuning or breaking time-reversal symmetry. The strategy 
involves creating flat bands by using the band-structure reconstruc- 
tion that occurs when a two-dimensional (2D) membrane undergoes 
a buckling transition”””°. Buckling transitions in stiff membranes are 
typically triggered by in-plane compressive strain’ ° that can be gener- 
ated during thermal cycling, by solvent-induced capillary forces, or by 
substrate-induced stress. On exceeding a critical strain value, buckling 
ofthe membrane reduces its elastic energy through out-of-plane distor- 
tions, resulting in intriguing periodic strain patterns (Fig. 1b, c) whose 
structures are dictated by boundary conditions and strain distribu- 
tion’. We find that in graphene, buckling-induced strain arrays give 


rise to a periodic pseudo-magnetic field (PMF)"° °, which reconstructs 


the low-energy band structure into a series of essentially flat bands. 
Unlike earlier realizations of PMF that were mostly local in nature”, 
the buckling transitions studied here produce a global change in the 
electronic structure, which comprises a sequence of flat bands that 
percolate throughout the material. 

We employed the thermally induced buckling transition in graphene 
deposited on NbSe, or hexagonal boron nitride (hBN) substrates to 
create a periodic PMF. The buckling structures studied here are typi- 
cally nested between ridges (Extended Data Fig. 1) that often form in 
graphene during fabrication. Topographical analysis suggests that 
the buckling is triggered by compressive strain generated by the col- 
lapse of ridges during thermal cycling (see Methods section ‘Sample 
preparation’). This produces various buckling patterns, from 1D to 2D, 
with similar nanometre-scale periods (Fig. 1b, c), suggesting a universal 
buckling mechanism that is insensitive to the lattice mismatch between 
graphene and its substrate. 

In Fig. 1awe show a schematic of the sample and scanning tunnelling 
microscopy (STM) measurement set-up. The topography of a 
buckling-induced triangular superstructure in graphene deposited on 
NbSe, is shown in Fig. 1b, right panel. This superlattice consists of alter- 
nating crests and troughs (bright and dark, respectively; Fig. 2a) with 
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Fig. 1| Buckled structures in graphene membranes. a, Schematic of sample 
geometry and measurement set-up. Top inset, Dirac cone representing the 
low-energy band structure of unstrained graphene outside the ridges. Bottom 
inset, pseudo-Landau levels (PLLs) renormalize the band structure in the 
strained superlattice. b, Left, buckling modes observed by STM topography in 
aG/NbSe, sample. Right, topography ina different area of the same sample 


0.17-nm height modulation and period a, = 14.4 + 0.5 nm. The large 
lattice mismatch between graphene (0.246 nm) and NbSe, (0.36 nm) 
rules out a moiré pattern’ interpretation (Methods). We first focus on 
the electronic structure obtained from the d//dV spectra (/is the cur- 
rent, Vis the bias) in the crest regions (Fig. 2b). The spectra comprise 
asequence of peaks identified as PMF-induced pseudo-Landau levels 
(PLLs)***”5, The energy of the most pronounced peak (labelled N= 0) 
is aligned with the charge neutrality point (CNP) as determined from 
the spectra taken inthe non-buckled area (Extended Data Fig. 2a). The 
CNP energy, £,, is shifted to about 0.5 eV above the Fermi level, owing 
tothe hole doping induced by the NbSe, substrate”®. Starting from the 
N=OPLL, which coincides with £p, we label the remaining PLLs in order 
of increasing (decreasing for the hole sector, VN< O) energies as N=+1, 
+2, and we find that they follow the sequence expected of LLs in flat 
monolayer graphene”, Ey = Ey + sgn(N)u,./2eA|N|Bpyp (here e is the 
electron charge and f = h/(2m) is the reduced Planck constant). This 
PLL sequence confirms the existence of a PMF in the buckled graphene 
membrane. Using the Fermi-velocity value of unbuckled graphene, 
v;=1.0x10° ms“, we estimate the value of the PMF in the centre of the 
crest region as Bpy-=108 + 8 T (Fig. 2b, right inset). Outside the buckled 
region, where the STM topography is flat, the spectrum exhibits a fea- 
tureless ‘V’-shape, as expected of unstrained graphene on NbSe,, with 
the minimum at F, = 0.5 eV (Extended Data Fig. 2). 

Another hallmark ofa PMF is the sublattice polarization of the elec- 
tronic wavefunction inthe N= 0 level. This means that for a given PMF 
sign, the N= 0 electronic wavefunction is localized on one sublattice, 
while for the opposite sign PMF it resides on the other sublattice””®””. 
The PMF-induced sublattice polarization, which is the counterpart 
of the magnetically induced valley polarization of the N= 0 level, fol- 
lows directly from the opposite signs of the PMF inthe K andK’ valleys. 
The observation of sublattice polarization in the atomic-resolution 
STM images shown in Fig. 2d-f (see below) thus provides a direct 
experimental signature of the extant PMF in buckled graphene 
superstructures”* ”, 
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shows atriangular buckling pattern (V,=0.5 V,/=40 pA). Inset, height profile 
along the arrow shows the approximately 0.17-nm height modulation of the 
buckling pattern with lateral period 14.4 +0.5nm.c,1D and 2D buckling modes 
observed by STM topography in different areas of the same G/hBN sample 
(V,=-300 mV, /=20 pA). Grey dashed lines are guides to the eye, marking the 
periodic structure. 


In both crests (Fig. 2e) and troughs (Fig. 2f), sublattice polarization 
is clearly revealed by the triangular structure, which reflects the fact 
that only atoms in one sublattice, say A, are visible in the crests while 
only atoms in the B sublattice are visible in the troughs. This in turn 
confirms that the sign of the PMF in the crests is opposite to that in 
the troughs. In the transition regions between crests and the troughs 
(Fig. 2d), we observe a honeycomb pattern where both A and B sublat- 
tices are visible, indicating a zero-crossing of the PMF. This PMF sublat- 
tice polarization is closely reproduced in the numerical simulations 
(Fig. 3) discussed below. 

Turning to the d//dVspectra in the trough regions (Fig. 2c), we note 
that they also exhibit a sequence of peaks. Plotting the energy of these 
peaks versus the level index we find that the sequence is essentially lin- 
ear inN, witha roughly equidistant energy spacing of about 89 +2 meV. 

To better understand these results, we performed tight-binding cal- 
culations inthe presence ofa periodic PMF witha triangular structure 
similar to that in the experiment (Fig. 3d): 


Bomr(x,y) = BIcos(b, - r) + cos(b,- 1) + cos(b3-r)] (1) 


Here B is the PMF amplitude, b,= "(1, - +): b,= x(0, 2), 


b,=b,+b,,and a, is the superlattice period. This choice of coordi- 
nates reflects the C6 symmetry of the PMF superlattice and the fact 
that the total PMF flux has to vanish, because time-reversal symmetry 
is not broken. Note that because Bpy, is defined as a sum of three 
cosines of amplitude B, its maximum value, attained on the crests, 
equals 3B. Its minimum value, attained in the percolating network of 
troughs surrounding the crests, is -1.5B. The zeros of this PMF con- 
figuration form circles surrounding each crest. Because of the spatial 
PMF variation, the measured spectra correspond to an effective PMF 
value, Ber, Which approximately averages the field over the 
pseudo-cyclotron orbit. This averaging effect becomes more pro- 
nounced as the ratio between the magnetic length and the lattice 
period decreases (Extended Data Fig. 6). We found that the PLL 
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Fig. 2 | Pseudo-Landau level quantization and sublattice polarizationin 
buckled graphene. a, STM topography (70 nm x 70 nm) ofa buckled graphene 
membrane supported on NbSe, reveals a 2D triangular lattice (V,=0.6V, 
!=20pA).b, d//dVspectrum taken in the crest region labelled by the red square 
in the left inset. The black down-arrows label the LL index, N=0, +1, +2... Right 
inset, pseudo-Landau level (PLL) energy F plotted against the square root of the 
LL index, N=0, +1, +2,...c, Same as b taken ina trough region (blue square inthe 
left inset of b). The negative second derivative of the d//dV signal (black curve) 
is superposed to better reveal the peak energies (black arrows). Inset, linear 


sequence obtained for B,,,=112 T (B= 120 T) matches the experimen- 
tally measured sequence shown in Fig. 2b. 

Figure 3a—c plots the evolution of the calculated local density of 
states (LDOS) with PMF amplitude, for each sublattice in the crest, 
transition and trough regions. In the crest and trough regions, we 
observe a strong imbalance in low-energy LDOS intensity between 
the A and B sublattices (Fig. 3a, c), whichis absent in the intermedi- 
ate transition region (Fig. 3b). This is consistent with the experi- 
mentally observed sublattice polarization in the crest (Fig. 2e) and 
trough regions (Fig. 2f), and with its absence in the transition region 
(Fig. 2d). In Fig. 3e we show the simulated LDOS spectrum in the 
crest region for B,,,= 112 T, together with a fit to the square-root 
dependence onN, which is consistent with the experimentally meas- 
ured spectra shown in Fig. 2b as discussed in Extended Data Fig. 6. 
The simulated LDOS spectrum in the trough region (Fig. 3f) is 
approximately linear in N witha level spacing of about 90 meV, con- 
sistent with the experimental results in Fig. 2c. To elucidate the 
origin of this linear peak sequence, we show in Fig. 3g the evolution 
of the LDOS calculated along a path connecting two crests (arrow 
in Fig. 3d, lower panel). The experimental peak positions in the crest 
(trough) regions are shown by right-pointing (left-pointing) arrows, 
respectively. In the centre of the trough the equidistant level 
sequence is clearly seen. We note that these levels are not solely 
determined by the local value of the PMF and they do not exhibit 
spatial dispersion. Furthermore, although these states spread into 
the crest regions, they disappear on approaching the PLLs in the 
crest centre. The discrete nature of these equidistant levels indicates 
that they originate from strain-induced confinement within the 
quantum well defined by the PMF, closely resembling magnetic 
confinement in quantum dots in 2D semiconductors**™. As in the 
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dependence of the peak energy on the level index, N. d-f, Atomic-resolution 
STM topography in the transition, crest and trough regions (shown 
respectively in the left inset of b as orange rectangle, red square and blue 
square). The colour scale for d and eis showninf. Aschematic honeycomb 
lattice, with the yellow and green balls representing the two sublattices, Aand 
B, issuperposed to highlight the sublattice polarization in the different 
regions. The dashed-line triangles indicate opposite orientations of the lattice 
polarization in each region. a.u., arbitrary units. 


case of quantum dots, here the electrons are trapped in a 
PMF-induced potential well, which produces a set of levels spaced 
by a characteristic (geometry-dependent) energy scale AE = hu,mt/W, 
where Wis the dot size. Using the energy scale of the levels in the 
troughs, approximately 90 meV, we estimate W = 21 nm, which is 
approximately the size of the well indicated by the grey dashed lines 
in Fig. 3g, lower panel. In the troughs, the energy of each level 
decreases with increasing PMF (Fig. 3c) until all levels merge into 
one degenerate level that approaches the CNP when the magnetic 
length becomes considerably smaller than the dot size. 

We next discuss the emergence of flat bands in this system. The 
periodic potential imposed by the PMF superlattice breaks up the 
low-energy conical band of graphene into a series of mini-bands 
whose width is controlled by the strength of the PMF amplitude, B. 
At low values of B, the minibands restructure the LDOS into a series of 
semi-discrete levels, as illustrated in Extended Data Fig. 7. As Bincreases, 
these levels evolve into increasingly narrower bands that become flat 
in the limit of large B. In Fig. 4a, we plot the first few minibands in the 
buckled graphene G/NbSe, sample for B,,-= 112 T. They all show flat-band 
segments along the K’-M’ line in the mini-Brillouin zone. The corre- 
sponding simulated LDOS contours in the three minibands with ener- 
gies F, - F, =—0.03 eV, F, — Fy =—-0.17 eV and E, — Ey = —-0.28 eV, shownin 
Fig. 4b, show good agreement with the measured d//dV maps in Fig. 4c, 
indicating that the model captures the salient features of the data. This 
sequence of buckling-induced flat bands that are well separated from 
each other would be ideally suited for hosting correlated electronic 
states, ifit were possible to align the Fermi energy within one of the flat 
bands. However, this was not possible in the buckled G/NbSe, sample, 
because of the finite conductance of the NbSe, substrate. Neverthe- 
less, our observations indicate that this could be achieved by creating 
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Fig. 3 | Simulated LDOS in buckled graphene. a—c, Simulated evolution of the 
LDOS with energy (F) and PMF amplitude (Bas defined in equation (1)) inthe 
crest (a), transition (b) and trough (c) regions. Upper (lower) panels represent 
the A (B) sublattice. The colour scale represents the LDOS intensity. d, Upper 
panel, STM topography (V, =500 mV, /=20 pA) showing the triangular 
superlattice with alternating crest (red) and trough (blue) regions. Lower 
panel, PMF configuration used in the simulation. The colour scale represents 
the PMF amplitude B (see equation (1)). The symbols indicate the positions of 
the calculated LDOS in a-c, and the green line indicates the calculated path for 
g.e,f, Upper panels, calculated LDOS as a function of energy Fin the crest and 
trough regions for B,.-=112 T (B=120 T). Lower panels, level-index dependence 


a buckling pattern in graphene deposited on an insulating substrate, 
as shown below. 

In order to study correlation effects in buckling-induced flat bands, 
we turn to the G/hBN sample. In this case, applying a gate voltage across 
the insulating hBN layer allows us to bring the Fermi level within the flat 
band. Using the same sample preparation process as for the G/NbSe, 
sample, we again obtain buckling superlattices as illustrated in Fig. 5a, b. 

The gate-voltage dependence of the d//dV spectra of the G/hBN 
sample in the transition region where the strain is minimal (Fig. 5c) 
shows that for V, = 0 the spectrum is ‘V’-shaped and that its mini- 
mum, which marks the CNP, is aligned with the Fermi level. This 
indicates that, unlike the case of G/NbSe,, the G/hBN sample is not 
doped by the hBN substrate. Tuning the gate voltage from —62 V 
to +38 V gradually moves the CNP from the hole-doped (about 
-100 mV) to the electron-doped sector (about +200 mV) in Fig. 5c. 
In Fig. Sd we compare the d//dV spectra in the crest and transition 
regions for the heavily hole-doped (V, =-62 V) case. Here, the crest 
spectrum features a prominent peak at the CNP that is flanked by 
a sequence of weaker peaks (blue arrows), closely resembling the 
crest spectrum in the G/NbSe, sample (Fig. 2b). As before, we iden- 
tify the prominent peak with the strain-induced N=0 PLL, and fitting 
the peak sequence with a square-root level-index dependence, we 
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of the simulated (black squares) and experimental (red dots) peak energy 
levels, measured relative to the charge neutrality point, F—£F,. The redline 
represents a fit to the experimental data. F, = 0.43 eVis taken fromthe 
experimental data on an unbuckled region of the sample. g, Upper panel, 
contour plot ofthe LDOS spectra (sublattice averaged) as a function of energy 
(E) and position (x) connecting two crests along the green line ind. Green 
arrows indicate the positions of the peaks in the measured spectra shownin 
Fig. 2b (right-pointing arrows) and Fig. 2c (left-pointing arrows). The colour 
scale represents the LDOS intensity. Lower panel, evolution of the simulated 
value of Bpy,/B along the green pathind. 


obtain the PMF value B,,,~ 104 T (Fig. 5d inset) consistent with the 
simulations described in Extended Data Fig. 9. Changing the back- 
gate voltage from the hole-doped (V, = -62 V) to the electron-doped 
(V, = +38 V) sector (Fig. 5e) we note that the N= 0 PLL tracks the 
CNP. As before the N= 0 PLL corresponds to a buckling-induced 
weakly dispersive flat band. When doping the sample to partially 
fill this band, a pseudo-gap feature at the Fermi level splits the peak 
in two, indicating the appearance of a correlation-induced state. 
Labelling the peaks above and below the Fermi level as upper band 
and lower band, respectively, we find that when the Fermi level is 
aligned with the CNP, the upper- and lower-band peaks have equal 
intensities. Doping away from charge neutrality, we observe a pro- 
nounced spectral weight redistribution between the two peaks, so 
that in the electron-doped regime (V, > 0) the intensity of the lower 
band dominates that of the upper band, whereas in the hole-doped 
regime (V, < 0) the upper-band intensity becomes dominant. The 
appearance of the pseudo-gap feature and the spectral weight 
redistribution in the partially filled flat band in the crest regions 
of buckled G/hBN is strikingly similar to that observed in the par- 
tially filled flat band of magic-angle twisted bilayer graphene” ”, 
where correlation-induced insulating and superconducting states 
have been observed. 


sv 


E,-E,=-0.17eV Ey) -E, = -0.03 eV 


-0.28 eV 


Fig. 4| Flat bands and LDOS maps. a, Calculated band structure for a buckled 
graphene superlattice with period a, =14 nm and PMF amplitude B,-,=112 T 
(B=120T). Inset, superlattice mini-Brillouin zone, nested within the original 
Brillouin zone of flat graphene, together with the trajectory along which the 


These findings demonstrate that buckling-induced periodic strain 
patterns offer anew experimental strategy for the creation of flat bands 
and for inducing correlated states with exceptional flexibility. The shape, 
period and symmetry of the buckled structures can be controlled by 
experimentally adjustable parameters, suchas boundary geometry and 
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band structure is calculated. b, c, Calculated LDOS contours (‘Theory’, b) and 
measured d//dV maps (‘Experiment’, c) at B.,;=112 T for the three energies 
(Ey — Ep, F, - Ep, E, - Ep) that correspond to the flat-band regionsina, as indicated 
by the colour-coded boxes. 


strain distribution, enabling the realization of flat bands with prescribed 
geometry”. We believe that the described way of buckling 2D crystals 
will be used widely to create other superlattice systems with controllable 
electronic band structure, thereby enabling the exploration of strong 
interaction effects and the emergence of correlated phases. 
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Fig. 5| Flat bandsin buckled G/hBN. a, STM topography of the buckled G/hBN 
sample (V, =—300 mV, /=20 pA). b, Height profile along the black lineina 
shows the approximately 3.5-A height modulation of the buckling pattern. 

The horizontal dashed lines demarcate the extent of the height modulation. 

c, Gate-voltage dependence of the spectra in the transition region shows the 
shift of the CNP (arrows) with doping (V, =—400 mV, /=20 pA). The black dashed 


Bias (V) 


line labels the Fermi level (£,).d, d//dVspectra from the crest (blue trace) and 
transition (black trace) regions labelled respectively by the star and the circle 
ina(V,=-400 mV,/=20 pA). Inset, PLL energy inthe crest area plotted against 
the square root of the LLindex, N=0, +1, +2,...e, Gate-voltage (V,) dependence 
of the spectra in the crest region shows the pseudo-gap feature and spectral 
weight redistribution in the partially filled flat band (V, =—400 mV, /=20 pA). 
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Methods 


Sample preparation 

To avoid oxygen and moisture contamination, the heterostructures 
are fabricated in a dry Ar atmosphere in a glovebox. Graphene is 
first exfoliated on a PMMA-—poly (methyl methacrylate)—film and 
then transferred onto athin NbSe, flake deposited ona SiO,/Si sub- 
strate. Instead of dissolving the PMMA in acetone, we mechanically 
peeled it off. The Au electrodes were deposited by standard SEM 
lithography. Before loading the sample into the STM chamber, the 
sample is annealed at 230 °C in forming gas (10% H, and 90% Ar) 
overnight to remove the PMMA residue. The STM experiments are 
performed in a home-built STM” at 4.6 K with a cut Pty slr. tip, 
and the sample is located using a capacitive-based self-navigation 
technique”. The tip used here is calibrated on the Au electrode, and 
the d//dV curves are measured using a standard lock-in technique 
with a small a.c. voltage modulation (2 mV at 473.1 Hz) added to 
the d.c. junction bias”. 


Buckling pattern formation 

Periodic pattern formation following a buckling transition is largely 
determined by the boundary conditions and stress distribution®’. In 
order to understand the buckling pattern in the G/NbSe, sample, we 
carried out large-area topography measurements that include the 
boundaries of the pattern (Extended Data Fig. 1). In Extended Data 
Fig. 1a, the triangular pattern is delimited by two intersecting ridges 
(labelled as ridge 1 and ridge 2, ~0.5 um long and 4 nm tall) that forma 
60° fan. Zooming into the fan area (Extended Data Fig. 1c), we discern 
the buckling pattern. In Extended Data Fig. Ic, d, we show that the period 
of the pattern increases monotonically with the distance from the apex 
where the two ridges meet. 

When graphene is deposited ona substrate, folds, ridges and bub- 
bles due to trapped gas or solvents are produced. Many of these 
defects disappear on annealing (see Methods section ‘Sample prep- 
aration’), but not all, presumably because their geometry is such 
that it does not allow the trapped species to escape or because the 
defect is pinned to the substrate. We observe that the ridges that 
survive the annealing step (Extended Data Fig. 1a) do not show the 
usual concave profile seen before annealing, but rather show evi- 
dence of collapse, probably due to freezing of the trapped gas that 
supported the ridge. Following the collapse, the ridge profile 
becomes convex and is flanked by two tall lips on the boundaries of 
the original fold (Extended Data Fig. le). The lip pointing inwards 
towards the fan area is consistently shorter than the outside lip, sug- 
gesting that some of the graphene membrane comprising the orig- 
inal fold was pushed inwards. This increases the area of the membrane 
trapped between the two ridges (ridges 1 and 2 in Extended Data 
Fig. 1a), resulting in biaxial compressive strain which can trigger the 
buckling transition. To test this scenario, we conjecture that the 
concave region of the ridge was originally part of the convex top. 
This suggests that one can reconstruct the original shape by a mirror 
transformation of the concave part relative to the green dashed line 
in Extended Data Fig. le (the line intersects the tallest point of the 
convex part and is parallel to the substrate). This produces the recon- 
structed dome shown by the red symbols in Extended Data Fig. le. 
Using this procedure immediately reveals a missing part of the 
original dome of length AL as indicated in the figure. The strain pro- 
duced by this increased length is estimated from the ratio between AL 
and the bisector of the 60° triangle: ¢ = (AL/L)sin30° = 2AL/L, where 
Lis the distance from the apex formed by the intersection of the two 
ridges (Extended Data Fig. If). 

Theoretical models and simulations of wrinkling based on minimiz- 
ing the energy of astretched or compressed membrane by allowing out 
of plane distortions®” have shown that there are simple scaling laws 
relating the period of the buckled membrane Ato the strain, 


Mt 7 4n?v 
(tL)? 30-v2e° 


Using t= 0.3 nm for the graphene thickness, v= 0.15 for its Poisson ratio 
and the expression for the strain as a function of distance L, we find 


An? vt? 


1/4 
B+ =014104, 
3(1-v7)440 


Fitting the data for the distance dependence of the strain, shown in 
Extended Data Fig. Id, to the expression A =A, + cL**, we obtain the 
constant c= 0.154 + 0.005, consistent with the above estimate. The 
value of the offset A, = (6 + 0.3) nm suggests that this formula breaks 
down at short distances. 

To understand why the ID scaling of the buckling period is consistent 
with our results, we consider the sketch shown in Extended Data Fig. 1b. 
We note that itis unlikely that both ridges collapse at the same instant. 
Nowif we suppose that the ridge marked by the green line in the sketch 
collapses first, it will produce a set of roughly parallel wrinkles whose 
spacing increases with distance as the strain decreases according to the 
scaling formula above. When subsequently the yellow ridge collapses, 
it produces a similar set of wrinkles roughly parallel to itself. The points 
of intersection of the two wrinkle sets will thus produce a triangular 
pattern of crests (black dots) while the areas in between will be troughs. 

Buckling transitions in thin stiff membranes have been studied 
extensively both experimentally and theoretically in the context of 
mechanical engineering. The details of the patterns that emerge after 
the buckling transition has taken place are controlled by boundary 
geometry and the strain distribution®*. Depending on the value of 
these parameters, a variety of buckling patterns are observed, includ- 
ing square, hexagonal, herringbone and stripes’ ®. 


Flat areas in G/NbSe, and G/hBN samples 
Superposing two 2D crystal structures produces a moiré pattern whose 
period is controlled by the two atomic lattice constants, a and b, and 
by the angle between their crystal orientations. The largest moiré 
period, which is obtained when the two crystals are aligned, is given 
by Aimax = (1+ 1/6)a, where 6= (b-a)/ais the lattice mismatch”. The lattice 
constants for graphene and NbSe, are a = 0.246 nm and b= 0.36 nm, 
respectively, leading to A,,,,, = 0.77 nm. This is more than an order of 
magnitude smaller than the superlattice periods observed in our work, 
immediately ruling out an interpretation in terms of a moiré pattern. 
To further confirm that the observed pattern is not due to a moiré 
structure, we show in Extended Data Fig. 2 the atomically resolved 
topography ofa region far from the two ridges, which shows the honey- 
comb structure characteristic of flat graphene (Extended Data Fig. 2b). 
Inthis region, the featureless STM topography (Extended Data Fig. 2a) 
together with the ‘V’-shaped d//dVspectrum confirm that the graphene 
is well decoupled from the bottom NbSe,. This procedure of taking 
spectra in an unbuckled region of the sample also served for tip selec- 
tion in all our measurements. When the spectra outside the buckled 
region showed an anomalous feature such as a gap or large dip at the 
Fermi energy’, the tip was reconditioned or replaced. Tip integrity is 
particularly important for distinguishing between correlation-induced 
gap features and artefacts introduced by the tip. 


Buckling patterns and d//dV maps 
The two panels of Extended Data Fig. 3a show the STM topography of 
the buckled graphene in the G/NbSe, sample taken with two different 
bias voltages. The blue lines are guides to the eye, connecting the crests. 
The crests remain bright for different bias voltages, consistent with 
their higher topography. 

In Extended Data Fig. 3b, we show the d//dV map at an energy cor- 
responding to the N= 0 PLL in the crest area of the G/NbSe, sample. 
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The uniformLDOS represented by this map differs from the petal struc- 
ture expected for a Gaussian bump” and reflects the unique geometry 
of the PMF induced by the buckled structure, which further confirms 
the theoretical model. 


Transition area in the G/NbSe, sample 

Extended Data Fig. 4a shows the theoretical contour plot of the 
LDOS spectra connecting two crest areas (see also the upper panel 
of Fig. 3g). In the transition area labelled by the yellow dashed line, 
we note that features from the crest (green arrows) and trough (red 
arrows) coexist. This coexistence is also observed in the experimental 
d//dV spectrum (Extended Data Fig. 4b), which exhibits peaks that 
can be traced back to both the crest (green arrows) and the trough 
(red arrows) regions. 


PMF dependence on superlattice period 

Extended Data Fig. 5 shows the simulated LDOS in the crest region as 
a function of PMF amplitude for several values of the superlattice 
period, a,,as marked. The dashed lines represent the field dependence 
of the LL energy in uniform fields. We note that as the lattice spacing 
increases, the spectra start approaching the unstrained LL sequence 
at lower values of the PMF. This is consistent with the fact that LLs, 
which correspond to cyclotron motion, can only form if the magnetic 
field B is fairly constant over length scales that are several times the 


magnetic length, [, =, |= = ee 


Effective PMF (B,,;) in the crest area 

Extended Data Fig. 6a shows LDOS cuts in the crest region from 
Fig. 3a (A sublattice) for several values of B, as marked. For each 
spectrum we calculated the value of the effective magnetic field, 
B+, from the energy of the first peak, F = v,,/2eABery. In order to check 
if the peaks can be interpreted as LLs, we mark by arrows the calcu- 
lated peak sequence for the corresponding Bor, Ey =+Vp./2ehBorN- 
We note that the arrows coincide with the spectral peaks for suffi- 
ciently large PMF amplitudes, suggesting that the LL language is 
appropriate for these spectra, as long as the B,»,- is used. However, 
the extracted effective field is much smaller than the maximum PMF 
value, that is, Boyar = 3B, reflecting the fact that the cyclotron orbit 
averages the field over an area of size /2N+1/,. For example, fora 
14-nm lattice period and amplitude B = 120 T, the peak sequence 
gives B.=112 T, whichis consistent with the experimental result in 
Fig. 2b. 

As shown in Fig. 3a and Extended Data Fig. 5, the N= 0 PLL under- 
goes a transition from a double peak at low PMF to a single one at 
a higher PMF: this transition occurs at a critical PMF value that is 
highly sensitive to the lattice spacing. This is clearly illustrated in 
Extended Data Fig. 6b, where we show that the simulated N= 0 dou- 
ble peak for B,,,= 112 T and a, = 14 nm merges into a single peak for 
a, = 14.8 nm. The data in Fig. 2—where a, = 14.4 + 0.5 nm, Ber = 112 T 
and the N= 0 PLL peak is unsplit—is consistent with the simulated 
spectrum (Extended Data Fig. 6b, red curve) in the regime above 
the critical PMF. 


Flat-band structure in buckled G/NbSe, 

In Extended Data Fig. 7 we plot the band structure and LDOS in the 
trough region for B = 140 T (Extended Data Fig. 7a, b) and B=180 T 
(Extended Data Fig. 7c, d). The figure shows the flattening of the bands 
with increasing field amplitude. 


Tight-binding model for the strained triangular lattice 
The effect of strain is included in the tight-binding Hamiltonian through 
the modulation of the hopping energy. This is given by 


tj = toe Fai/tee"D 


(2) 


where # is the decay coefficient, a,, is the unstrained carbon- 
carbon bond, and d;, is the strained bond length defined by the 
strain tensor € as 


d= (I+ €)8;, (3) 


where 6, is the vector in the direction of the bond between atomsiand 

j,and/is the unitary matrix. Changes in the hopping energies generate 
a strain-induced vector potential in the system, which in the case of 
hexagonal lattices is given by 


: 1 Kr; 
A, ~ iA, =- ae 2 bre". (4) 
The corresponding PMF is calculated using 
B=VxA=|0,A,~d,A,e,. (5) 


Owing to gauge invariance, we may choose A, = O. Hence, our vector 
potential is then given by A, =f B(x, y)dy , Where B(x, y), shown in 
Extended Data Fig. 8a, is given by equation (1). This leads to 


ap AP acy: 1 a 1 a 

A, = B—| — sin(b,-r) + — sin(b,-r) + — sin(b,r) |. (6) 
* aos t byy 2 bsy ? 

Substituting ¢, =fo(1+ 6¢,;) and expanding equation (4) up to first 

order, we obtain the following expression for the vector potential® 


= 


1 
Ar Ay) =~ 305 284, 61-83, 


(6t,- ax) (7) 


where &¢,, 5t, and 8¢, are the strain modulations of hopping energies 
along the directions of graphene’s nearest neighbours 6,, 6, and 63, as 
shown in Extended Data Fig. 8b, and vu; = 3t94,,./(2h)is the Fermi veloc- 
ity. The choice of the gauge (A, = 0) results in 8¢, = 6t, = 5t. We choose 
6t, = —6¢ and, finally, the strain modified hopping energies are 
given by 


(8) 


where @, =h/e is the magnetic flux quantum. 


Theoretical results for the G/hBN sample 

To simulate the buckling pattern in graphene on hBN, weusea similar 
expression as in the case of graphene on NbSe,, given by equation (2), 
but without the last cosine term: 


Bomr(x,y) = Blcos(b,-r) + cos(b,-r)] (9) 


with b, = (211/az)(0, 1) and b, = (211/a})(1, 0). This changes the sym- 
metry of the unit cell from triangular to rectangular. The profile of the 
field is shown in Extended Data Fig. 9a with the unit cell marked by the 
dashed rectangle. The size of the unit cell is chosen so as to match the 
buckling periods shown in Fig. 5a. The LDOS in the crest region is given 
in Extended Data Fig. 9b, where clear LLs can be observed in the spec- 
tra. Acut of the LDOS map at aconstant value of B = 62 T (white dashed 
line in Extended Data Fig. 9b) shows LLs that scale with the square root 
of the LL index (see Extended Data Fig. 9e, g) and result in an effective 
field of B..-= 100 T. Moving towards the transition region (magenta 
square in Extended Data Fig. 9a) where B= OT, the LDOS spectrum 


changes substantially, as shown by Extended Data Fig. 9c. Here, the 
spectrum does not show LLs, which is expected since there is no PMF. 
However, the LDOS reveals new sets of peaks that depend weakly on 
the amplitude of the field. This is further confirmed in Extended Data 
Fig. 9d, which shows that the dispersion of these peaks with unit cell 
size a, is consistent with that expected of confinement states. Taking 
acut of the LDOS map from Extended Data Fig. 9c, at B= 62 T results 
in equidistant peaks separated by about 83 meV as shown in Extended 
Data Fig. 9f, h. 


Robustness of the PLLs against disorder 

As seenin Fig. Ic, the rectangular buckling array inthe G/hBN sample is 
not perfectly periodic as implied by the double cosine potential used 
inthe simulation. It is then natural to ask how robust are the simulated 
DOS andthe corresponding d//dVspectrum against lattice distortions. 
To address this question, we carried out numerical simulations where 
the periodicity condition was relaxed. As detailed below, we find that 
the PMF spectrum of the rectangular buckling structure is quite robust, 
and can survive substantial deviations from the perfectly periodic 
lattice structure. 

To study the effect of relaxing the condition of perfect periodicity, 
we introduced a random variation in the unit cell of the PMF within 
a predefined range. The procedure is as follows. Since the original 
unit cell is too large, we reduced the larger unit vector to the size of 
the smaller one, that is, a,=a,=20 nm. The PMF profile for this case 
is shown in Extended Data Fig. 10a. Changing the size of the unit cell 
should not change the physics of the problem, but numerically this 
substantially speeds up our calculations. The system is a circle of radius 
of 200 nm with absorbing boundary conditions applied at the edges 
(the method is described in detail in ref. °°”). A vector potential is added 
to the Hamiltonian as described above, with the unit cell period given 
by [(1+ 4R,)a,, (1+AR,)a,], where R, and R,are random numbers froma 
uniform distribution in the range [-1, 1] and A is the weight parameter. 
As anexample, in Extended Data Fig. 1Ob-d we plot a few PMF profiles 
using A = 0.15, 0.25, 0.33, respectively. The calculated DOS is shownin 
Extended Data Fig. 10e (the DOS is calculated using the kernel polyno- 
mial method, as explained in ref. *°). The blue curve in this plot shows 
the DOS of the perfectly periodic system. The peaks observed here are 
the flat bands, and the separation between the peaks is around 60 meV. 
As Ais increased, the peaks are averaged out and eventually disappear 
for large A, which is the expected result. However, up to A = 0.25, the 
peaks are still present in spite of the substantial disorder introduced 
inthe system. 

In addition to the robustness of the PLL spectrum against lattice 
disorder, the resemblance between the spectral features of the G/hBN 
and G/NbSe, samples in the crest regions further indicates their com- 
mon origin, as detailed below. 

(1) In both systems, the features are observed only in the buckled 
regions of the graphene membrane and are absent in the unbuckled 
parts of the sample. 

(2) In both systems, the buckling gives rise to a strain-induced PMF 
resulting in the PLL observed in the d//dV spectra. The PLL sequence, 
which is characterized by a peak at the Dirac point and a square-root 


dependence on level index, is observed on the crests, consistent with 
the numerical simulation results. 

(3) In both systems, the spectra in the regions between maxima of 
the PMF show confinement character, rather than PLLs. Specifically, 
they do not feature the peak at the Dirac point corresponding to the 
N=OPLL and the peak sequence does not follow a square-root level 
index dependence. Instead the spectrum in this regime is consistent 
with magnetic confinement states. 


Transition region in the G/hBN sample 

In the case of the G/NbSe, sample, the crest and trough regions of the 
triangular buckling pattern observed in topography coincide with the 
maximum and minimum PMF, respectively. This designation is not as 
straightforward in the rectangular buckling pattern observed in the G/ 
hBN sample. While in this sample the crest regions where the PMF has 
its maximum magnitude are readily identified in the topography as the 
intersection point between two of the quasi-1D wrinkles that form the rec- 
tangular buckling pattern (Fig. 1c), itis less easy to identify the transition 
regions where the PMF is vanishingly small. We instead used the fact that 
inthe transition region the PMF vanishes, resulting in the disappearance 
ofthe PLLs and the characteristic peak at the CNP. By using this criterion, 
we identified the transition region marked by the circle in Fig. 5a. 


Data availability 


The data that support the findings of this study are available from the 
corresponding authors on reasonable request. 
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Extended Data Fig. 1| Buckling pattern of graphene. a, Large-areaSTM 
topography of G/NbSe, shows two ridges that delimit the buckling pattern 
(V,=-0.3 V,/=20 pA). b, Schematic of wrinkles arising from the compressive 
strain at each boundary ridge. Crests form at the wrinkle intersections, marked 
by black dots. c, Zoomed-in topography image of the triangular buckling 
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pattern (V, =0.5 V,/=20 pA) ina. d, The superlattice constant, measured along 
the line marked by the blue arrow inc, increases monotonically with distance 
from the apex where the two ridges meet. e, Height profile along the green 
arrowina. f, Strain produced by the collapse of the ridges calculated frome, 

as described in Methods section ‘Buckling pattern formation’. 


| G/NbSe, 


di/dV (a.u.) 
di/dV (a.u.) 


G/NbSe, ina (V, =—0.3 V, /=30 pA).c, Same asa but froma flat region of 
the G/hBN sample. Inset, d//dVspectra of flat G/hBN for conditions of 
electron-doping (green trace) and hole-doping (red trace); V,=—-0.3 V,/=20 pA. 


Extended Data Fig. 2| Topography of unbuckled regions in the G/NbSe, 
sample. a, Main panel, STM topography of a flat (unbuckled) region of the 
G/NbSe, surface far from the ridges. Inset, d//dVspectrum fromthe region 
shown in the main panel (V, = 0.5 V,/=30 pA). b, Atomic-resolution view of 
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Extended Data Fig. 3 | Topography of buckled regions inthe G/NbSe, b, d//dV map over an area of size6nm x 6nminthe crest region at the energy of 
sample.a,STM topography ofa region in the buckled graphene membrane theN=OPLL. 
measured with different bias voltages: 500 mV (left) and 50 mV (right). 
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Extended Data Fig. 4| Transition area in the triangular buckling pattern of intensity. The yellow dashed line labels the LDOS spectrum in the transition 
the G/NbSe, sample. a, Theoretical contour plot of the LDOS spectra region plotted inb. b, Experimental d//dVspectrum in the transition region 
connecting two crest areas (atx=Onmandx=25nm;see Fig. 3g, upper panel) between crests and troughs. Green and red arrows indicate the corresponding 


versus energy, £, and position, x. The colour scale bar represents the LDOS peaksina. 
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Extended Data Fig. 5| Evolution ofthe calculated LDOS withPMF amplitude — the amplitude of the magnetic field, B, and energy, FE. Dashed lines represent 
for several superlattice periods. Shown are contour plots of the LDOS for the field dependence of the PLL energy ina uniform magnetic field given by the 


several values of a, (left to right: 12 nm, 14 nm, 16 nm) in the crest region versus x-axis values. 
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Extended Data Fig. 6| Calculated PMF inthe crest areas of the buckled 
G/NbSe, sample. a, LDOS cuts in the crest region from Fig. 3a, upper panel 
(Asublattice), for several values of Band corresponding B,,; (first and second 
columns inthe key). The ratio of the effective PMF obtained from the PLL 


spectrum on the crests to the maximum PMF value (Ber /Bpnyy) iS Shown in the 
rightmost column of the key. b, Comparison of crest LDOS at B,,;=112 T for two 
lattice constants (14.0 nm and 14.8 nm) in the regime where the N= 0 peaks 
merge into one. 
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Extended Data Fig. 7 | Calculated low-energy band structure and LDOS in the trough regions of the buckled G/NbSe, sample. a, b, Band structure (a) and 
LDOS (b) fora PMF of 140 T. c,d, Asa but for a PMF of 180 T. Calculations are for a superlattice period of 14nm. 
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Extended Data Fig. 8 | Tight-binding model for a strained lattice. a, Calculated profile of the PMF given by equation (1) for lattice spacing a,=14nm 
b, Schematic of nearest neighbour vectors, 6,,6, and 6,, ina graphene lattice. 
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Extended Data Fig. 9| PMF for arectangular buckling pattern in graphene. 
cellsize. The colour scale bar represents the LDOS intensity. e, Cut of the LDOS 


a, Profile of the PMF in arectangular buckling pattern. Dashed rectangle shows 
the magnetic unit cell. b, LDOS (sublattice averaged) versus the PMF amplitude fromb for aconstant value of the field, B= 62 T, shown by the dashed white line. 


Bandenergy Eat acrest position ina (centre of red region marked byagreen f, Cut of the LDOS fromc fora constant value of the field, B = 62 T, shown by the 
dot). The colour scale bar represents the LDOS intensity inarbitrary units. Note | dashed white line. g, Fitting the peak sequence to a square-root dependence on 
that the same result is obtained in the centre of troughs (blue regions) where the PLL index gives an effective PMF of 100 T. h, The peak sequence inf gives a 
the PMF signis reversed. c, As b but for a point with zero field, markedinaby linear dependence on the peak index with an energy spacing of about 83 meV. 
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Extended Data Fig. 10| DOS versus unit cell size in the presence of lattice 
disorder. a-d, The PMF profile used for calculating the DOS shown ine. The 
unit cell period variation range is given by (1+ A), witha,=20nmandA=0 (a), 
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0.15 (b), 0.25 (c) and 0.33 (d). e, DOS obtained for different values of A givenin 
the key for B=62T (asin Extended Data Fig. 9). 
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The activation of abundant molecules such as hydrocarbons and atmospheric 
nitrogen (N,) remains a challenge because these molecules are often inert. The 
formation of carbon-nitrogen bonds fromN, typically has required reactive organic 


precursors that are incompatible with the reducing conditions that promote N, 
reactivity’, which has prevented catalysis. Here we report a diketiminate-supported 
iron system that sequentially activates benzene and N, to form aniline derivatives. 
The key to this coupling reaction is the partial silylation of a reduced iron—-dinitrogen 
complex, followed by migration of a benzene-derived aryl group to the nitrogen. 
Further reduction releases N,-derived aniline, and the resulting iron species can 
re-enter the cyclic pathway. Specifically, we show that an easily prepared 
diketiminate iron bromide complex’ mediates the one-pot conversion of several 
petroleum-derived arenes into the corresponding silylated aniline derivatives, by 
using a mixture of sodium powder, crown ether, trimethylsilyl bromide and N, as the 
nitrogen source. Numerous compounds along the cyclic pathway are isolated and 
crystallographically characterized, and their reactivity supports a mechanism for 
sequential hydrocarbon activation and N, functionalization. This strategy 

couples nitrogen atoms from N, with abundant hydrocarbons, and maps a route 
towards future catalytic systems. 


Reduction or ‘fixation’ of N, is accomplished by a few catalytic 
systems? § that fully reduce N, to ammonia (NH;)”. It is difficult to 
adapt these processes to form C-N bonds from N,, even though sev- 
eral functionalizations of N, are thermodynamically feasible’®. Efforts 
to form C-N bonds fromN, generally require reduction of N, to make 
it nucleophilic enough to react with carbon electrophiles””. Notable 
examples include synthetic cycles for C-N bond formation”. A fre- 
quent difficulty is the direct reaction of the reducing agents with the 
carbon electrophiles. 

There would be a substantial advantage if N, could be induced to 
forma C-N bond witha simple hydrocarbon, without prior function- 
alization of the hydrocarbon to convert it into an electrophile. Sucha 
strategy would require C-H activation of the hydrocarbon, making use 
of decades of work on oxidative addition of C-H bonds by low-valent 
transition metal complexes. Although C-H functionalization to form 
cross-coupling products is well known”, we know of no previous exam- 
ples of cross-coupling of C-H bonds with the relatively unreactive N, 
molecule. 

Here, we report an iron complex that couples N, and unactivated 
arenes at low temperature, taking advantage of silyl activation of the 
N, to produce silylated anilines in a one-pot procedure. The overall 
strategy (Fig. 1) begins with C-H bond activation of benzene to form 
a phenyl fragment, which could migrate to N, upon silylation to form 
the key C-N bond. To explore the feasibility of this pathway, we con- 
verted the iron(11) bromide complex [LFe(-Br)], (1) into the iron(1) 
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Fig. 1| Strategy for converting benzene and N, into silylated aniline without 
the use of carbon electrophiles. The reduction steps are not shown here for 
simplicity, but are elaborated in Fig. 4. Inset, the iron B-diketiminate fragment 
used inthis implementation of the strategy. 
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Fig. 2| Activation of benzene. a, Reduction of 2 with KC,/18c6 formed 3-K (left) 
and reduction with Na/15c5 formed 4-Na (right). In THF solvent the cations 
dissociate, and isomeric anions 3 and 4 are in equilibrium. b, Conversion of 4-Na 


benzene complex LFe(n°-C,H,) (2)”°. Figure 2 shows the structure of 2 
and illustrates the B-diketiminate ligand L, which controls the coordi- 
nation environment of iron”. Reduction of 2 with KC, in the presence 
of 18-crown-6 (18c6) at room temperature led to the isolable, purple 
iron(O) complex LFe(n*-C,H,)K(18c6) (3-K)?. When 2 was reduced with 
Na and 15-crown-5 (15c5) in benzene, the product was the red iron(II) 
complex LFe(H)(Ph)Na(15c5) (4-Na), in which a C-H bond of benzene 
is broken to give an iron(II) complex with phenyl and hydride groups 
ontheiron (Supplementary Fig. 50). Thus, 3-K and 4-Na have isomeric 
anions (bottom of Fig. 2a). Méssbauer spectra of solid 3-K and 4-Na 
(Supplementary Figs. 23-25) support this difference, and density func- 
tional theory (DFT) studies indicate that the molecules have high-spin 
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to 6-Na occurred spontaneously in 18% yield (top pathway) but gave a higher 
yield of 75% with the hydride acceptor BPh, (bottom pathway), suggesting that 
hydride loss is probably the main reaction pathway. 


electronic configurations (spin states are S=1 for 3-K and S=2 for 4-Na). 
This is a rare case of room-temperature C-H activation using a high-spin 
iron complex???, Reduction of 2 with Na and 15c5 in toluene similarly 
gave LFe(H)(Tol)Na(15c5), from activation of the aryl C-H bonds of the 
solvent. The X-ray crystal structure refined best to a 61:39 mixture of 
meta and para isomers (Supplementary Fig. 54). 

The solution 'H nuclear magnetic resonance (NMR) spectra of 3-K 
and 4-Na in C.D, showed a few peaks that disappeared over several 
hours, suggesting that these benzene or phenyl groups exchange with 
C,D,. Additionally, a minor component was observed in the'H NMR 
spectrum of the C,D, solutions of 3-K that resembled those of 4-Na, and 
vice versa. These observations suggested that the arene-bound iron(0) 


complex 3 and the C-H activated iron(II) complex 4 are in equilibrium 
with one another, and theiron(0) arene isomer (3) predominates with 
K(18c6)*, whereas the oxidatively added iron(II) phenyl hydride iso- 
mer (4) predominates with Na(15c5)* (see Supplementary Tables 7, 8). 
This hypothesis was supported by solution studies in tetrahydrofuran 
(THF), asolvent that can disrupt the interactions with the alkali metal. 
Dissolving either purified 3-K or purified 4-Na in THF gave mixtures 
with characteristic peaks of both isomers 3 and 4 in'H NMRand Moss- 
bauer spectra (Supplementary Figs. 9, 26). The ability of each isomer to 
produce the other one demonstrates that there is reversible oxidative 
addition of the aryl C-H bond of 3 to form 4. 

Purple THF solutions of LFe(H)(Ph)Na(15c5) (4-Na) turned green after 
standing at room temperature for several hours, and'‘H NMR spectra 
showed formation of the iron(I) phenyl complex [LFePh][Na(15c5)] 
(6-Na) (Fig. 2b). Although the formation of 6-Na from 4-Na corresponds 
to formal loss of He, the spectroscopic yield of 6-Na from 4-Na in THF 
was only 18% (Supplementary Fig. 17), and the best yield of 6-Na (44%) 
came from addition of excess 15c5 and Nato 2in diethyl ether (Supple- 
mentary Fig. 18). These results suggest that the reaction could proceed 
through a disproportionation pathway. Ina mechanistically revealing 
experiment, 4-Na transferred the hydride to triphenylborane (BPh,) 
to give the previously characterized” 5 and [Na(15c5)][HBPh;] (Sup- 
plementary Figs. 19, 20) ina higher yield of 75%. Therefore, itis possible 
that in the presence of excess reducing agent, 4-Na can similarly lose 
hydride to give 5, whichis subsequently reduced by 3 or by Nato give 
the observed 6-Na. 

Next, we explored N, binding. Cooling a solution of 6-Nain THF under 
an atmosphere of N, led to changes in the electronic absorption (ultra- 
violet—-visible; UV-vis) spectrum (Fig. 3a) and the Méssbauer spectrum 
(Fig. 3b) that did not occur in control experiments under argon. Van’t 
Hoff analysis of the UV-vis data for 6-Na gave AH=-17 +2 kJ mol‘ and 
AS =-55+10JK' mol? (Supplementary Fig. 40; AH is the enthalpy 
change, AS is the entropy change; uncertainties represent standard 
deviations), which are consistent with the binding of N,. To obtain 
crystallographic verification of N, binding, we cooled a concentrated 
sample of the potassium analogue 6-K at —78 °C for 3 h, which led to 
crystals of the N, complex 7-K. The X-ray crystal structure of this potas- 
sium salt of 7 (Fig. 4, bottom) demonstrates end-on binding of the 
N, unit and a pseudotetrahedral geometry at the iron(1) site, and the 
spectroscopic similarity to the Na system (Supplementary Figs. 36-39) 
indicates that 7-Na has a similar structure. 

Next, we silylated the N, ligand that had been activated through 
Fe coordination. Treating a cold solution of 7-K with 2 equivalents 
(equiv.) of Me,SixX (X = Br, I) formed the hydrazido complex LFe(N(Ph) 
N(SiMe;,),) (8). To our knowledge, the transformation of 7 to 8 is the 
first crystallographically verified example of the migration of a hydro- 
carbyl group froma metal to the a position of N, (refs. °”°). This is the 
key C-N bond-forming step during the formation of silylated anilines 
(Fig. 1), which differs from the previously used attacks on bound N, by 
carbon electrophiles. 

Although we did not detect any intermediates during the conversion 
of 7 to 8, the treatment of acold solution of 7-K with 0.5 molar equiva- 
lents of the bulkier triisopropylsilyl triflate (TIPSOTf; OTf = SO,CF;) 
gave the formally iron(II) diazenido complex 9, in which theN, is singly 
silylated while the phenyl group remains bound to the iron (Supplemen- 
tary Fig. 55). The conversion of 7 to 9 is accompanied by a similar yield of 
the iron(II) complex 5, which results from half of 7 acting as a reducing 
agent. Considering this stoichiometry, the formation of 9 occurs in 67% 
yield. The isolation of 9, in which the phenyl! has not migrated, suggests 
that the initial silylation of the B position of the coordinated N, in7 
takes place before the migration of the aryl group. It is likely that the 
second silylation induces the aryl migration, because a recent study” 
reported the migration of H from an iron centre to the a position of a 
doubly silylated N, group to form an iron disilylhydrazido complex. 
Accordingly, addition of excess trimethylsilyl triflate (Me,SiOTf) and 
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formation of 7-Na at lower temperatures. The data for 6-K are similar 
(Supplementary Fig. 37). b, *’Fe Mossbauer spectra at 80 K. The black circles are 
the data from samples of 6-K in THF solution frozen under 1atm Ar (top) and 
latm N, (bottom), demonstrating the formation of anew doublet attributed to 
the N, complex 7-K (green subspectrum) in addition to some remaining 6-K (red 
subspectrum). The blue line is the sum of the green and red fits, and the grey lines 
are the residuals between the fits and the data. c, Illustration of the equilibrium of 
6andN, to form 7. Ina key transformation, silylation of the bound N, in7 induces 
migration of the phenyl group to the N, fragment, giving 8. When a bulkier silyl 
group is used, it is possible to isolate the singly silylated species 9, which can 


subsequently form C-N bonds. Each of the iron species is characterized through 
X-ray crystallography (Fig. 4, Supplementary Figs. 50-55). 


excess Na to 9 gave a 14% yield of PhN(SiMe;),, showing that addition 
of asecond silyl group can initiate C-N bond formation. This could be 
because the second silylation leads to a formally iron(Iv) complex with 
an Fe=N double bond, a migration that is reminiscent of alkyl migration 
to Nin an imidoiron(1v) complex”®. 
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Fig. 4 | Proposed cyclic reaction mechanism for the conversion of N, and 
benzene to aniline, mediated by iron B-diketiminate complexes. Thermal 
ellipsoid plots of the isolated intermediates (and the K analogue of 3) are shown 


The conversion of 8 to the silylated aniline and amine occurred upon 
the addition of 1 equiv. KC, and 2 equiv. Me;SiX (X = Br, I) to solutions of 
8; this treatment led to mixtures containing PhN(SiMe;,),, PhN(SiMe;) 
N(SiMe;), and N(SiMe;), within 30 min, either at room temperature 
or at -100 °C. By contrast, 8 did not react with 2 equiv. Me,SiBr alone 
within 3 days at room temperature. This result suggests that reduction 
precedes the electrophilic attack of silyl groups on 8; alternatively, this 
final step could involve the formation of Me,Si- radicals”* °°. Other stud- 
ies have also reported on the reductive silylation of disilylhydrazido 
complexes*??!, 

After release of the silylated nitrogen products from 8, the large 
excess of bromide is expected to give iron(II) bromide species that 
are poised to be reduced with further arene binding. This suggests 
the feasibility of a cyclic process (Fig. 4) ina single pot, which forms 
silylated anilines from arenes and N,. However, the C-H activation and 
hydride loss to reform complex 6 requires room-temperature treat- 
ment with Na, and at this temperature Na degrades Me,SiBr. Further, N, 


224 | Nature | Vol584 | 13 August 2020 


a 


— Na(15c5)H 


{ Cor) 
ey ay, 


Na 
15c5 


[Na(15c5)]* [Na(15c5)]* 


+: 


2Me,SiX 
15¢5 


— 2Na(15c5)X 


at the top and bottom (for ionic compounds, only the anion is shown). Most 
hydrogen atoms and isopropyl groups are omitted from the thermal ellipsoid 
plots for clarity. 


binds to 6 at low temperatures. Thus, we treated 8 with Na (25 equiv.), 
benzene (20 equiv.), Me;SiBr (6 equiv.) and 15c5 (5 equiv.) at-100 °Cin 
diethyl ether, then warmed it to room temperature for 1h, then cooled 
it again and treated it with additional Me,SiBr (6 equiv.). This led to a 
92% yield of PhN(SiMe;,), and a 135% yield of N(SiMe;), (yields versus Fe). 
To verify that the overall process is indeed cyclic, 8 was treated under 
the same conditions with added toluene in place of benzene. Analysis 
of the organic reaction products after two cycles of Me,SiBr addition 
showed a163% yield of N(SiMe,);, a 62% yield of PAN(SiMe;), and a 16% 
yield of (tolyl)N(SiMe,), (both meta and para isomers). The ability of 
the phenylhydrazido complex to give products from toluene amina- 
tion demonstrates that the iron-containing products of the hydrazido 
reduction can activate toluene and continue to another reaction cycle, 
although the yields are low. 

To test the overall cycle, we reacted the easily prepared iron(II) com- 
plex 1 with 30 molar equivalents (equivalents relative to [Fe]) of Na, 20 
equiv. C,H, and 5 equiv. 15c5 at ambient temperature in diethyl ether, 
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Fig. 5| Aniline products from amination of arenes withN,. a, Mass spectra of 
PhN(SiMe;), products from the reaction under 1 atm of natural-abundance N, 
(top) and under 1 atm of 'N, (bottom), showing that the nitrogen atom inthe 
product derives fromN,.b, Yields (relative to Fe) of N-containing products 
obtained during N,-based amination of different arenes, using five cycles of 
Me,SiBr addition (2 equiv. per cycle). 


followed by addition of 6 equiv. trimethylsilyl bromide (Me,SiBr) at 
-108 °C, which afforded PhN(SiMe;), ina yield of 24% per iron atom, 
as determined by gas chromatography. The yield of the reaction could 
beincreased by adding Me,SiBr in one portion per temperature cycle. 
First, a mixture of 1, 35 equiv. Na, 20 equiv. benzene and 5 equiv. 15c5 
in diethyl ether was stirred vigorously for 1.5 h at room temperature 
until it became green, corresponding to the colour of 6. Cooling the 
mixture to —108 °C under 1 atm N, resulted in a colour change to dark 
red, corresponding to 7. Then, addition of 2 equiv. Me,SiBr (per iron 
atom) to this cold solution and warming to room temperature for 1h 
resulted in another green reaction mixture, suggesting that 6 was 
regenerated. Cooling again caused the same colour change to red (7), 
and more Me,SiBr was then added inasecond cycle. Repeating 10 cool- 
ing-silylation-warming cycles with 2 equiv. Me,SiBr per cycle gave 
a cumulative yield of (85 + 14)% PhN(SiMe,), (versus Fe; average and 
standard deviation of six trials; Supplementary Table 1). The ability 
to produce more product with repeated Me,SiBr additions suggests a 


cyclic process, albeit one in which a substantial amount of the active 
species decomposes in each cycle. Although aniline formation was 
attenuated with repeated cycles, the yield of N(SiMe,), continued to 
increase with the number of additions of Me,SiBr, reaching (380 + 41)% 
(versus Fe) (Supplementary Figs. 1,2). Under the same conditions but 
inthe absence of benzene, no PhN(SiMe,), was produced, but a similar 
catalytic yield of N(SiMe;)3 was observed. Neither product was detected 
inthe absence of 1. These results suggest that the iron decomposition 
products lose the ability to aminate benzene but remain able to silylate 
N, to N(SiMe,),—a more frequent reaction that has been reported with 
other homogeneous catalysts and decomposition products” *°. 

Isotope labelling experiments were used to verify that the aniline 
product arises from benzene and N,. When performing the reaction 
under an atmosphere of ©N,, gas chromatography coupled with mass 
spectrometry indicated the formation of Ph®N(SiMe;), and °N(SiMe;), 
(Fig. 5a), demonstrating that N, is the source of the N atoms. Perform- 
ing the reaction with C.D, as the arene substrate gave (C,D;)N(SiMe;),, 
showing that benzene is the source of the phenyl group. The reaction 
with an equimolar mixture of C,H, and C.D, gave a1:1 mixture of (C,H;) 
N(SiMe,), and (C,D;)N(SiMe;), (Fig. 5b), but partially deuterated 
1,3,5-d,-benzene gave (C,H,D;)N(SiMe,), and (C,H;D,)N(SiMe,), ina 
ratio of 2.01+ 0.05, indicating anormal primary kinetic isotope effect 
for the C-H cleavage step. The difference between the intramolecular 
and the intermolecular isotope effects" indicates that arene bind- 
ing/exchange is not much more rapid than the irreversible step after 
C-H cleavage, which is qualitatively consistent with the timescales of 
the experiments described above with the 3/4 equilibrium (which is 
established over 1-2 hin diethyl ether) and the formation of 6 (which 
takes a few hours). 

Other arene substrates were also tested using 5 cycles of Me,SiBr 
addition (Fig. 5c), with a yield of benzene to PhN(SiMe;), of (68 + 4)% 
per iron. Toluene gave a mixture of (m-tolyl)N(SiMe;), and (p-tolyl) 
N(SiMe,), ina 3:1 ratio with a total yield of (61+ 7)% periron. The overall 
yield is similar to that for benzene, and the ratio of isomers is compa- 
rable to that observed in the crystal of LFe(H)(Tol)Na(15c5) (Supple- 
mentary Fig. 54). When o-xylene was used, a (12 + 2)% yield of N,N-bis 
(trimethylsilyl-3,4-xylidine) was observed. These silylated aniline prod- 
ucts could be hydrolysed to the deprotected anilines with weak aqueous 
acid when desired. Arenes with easily reducible functionalities—such as 
aryl halides, aryl ethers and polycyclic aromatics—did not give aminated 
products. Formation of the silylated aniline and N(SiMe,), also occurred 
with other Me,Six reagents (X = Cl, I, OTF), although the reaction gave 
the highest yields with Me,SiBr. 

The strategy outlined here differs fundamentally from previ- 
ously described strategies for the formation of C-N bonds from N,. 
Typically, carbon electrophiles have been used to create C-N bonds, 
either fromN, (refs. "32-) or nitrides that result from cleavage of N, 
(refs. 51”**), In the proposed reaction, silylation plays a key part by 
making the coordinated N, sufficiently reactive to accept the migrat- 
ing aryl group from the metal centre. The silylated amines that are 
formed can be used in further synthetic steps or can be deprotected 
to the parent anilines using mild aqueous acid. In this method, the 
C-Nbond comes from the migration of a hydrocarbyl from a metal to 
aN,-derived group, a strategy that has been used in few stoichiometric 
C-Nbond formation reactions’*”’. The ability of these iron complexes 
to generate a hydrocarbyl group on the iron through C-H activation, 
and then transfer it to an activated N,, provides anew tactic for coupling 
hydrocarbons to N atoms from atmospheric N,, combining the powers 
of these bond-cleaving reactions. 
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® Check for updates 


Anthropogenic global surface warming is proportional to cumulative carbon 
emissions! >; this relationship is partly determined by the uptake and storage of heat 
and carbon by the ocean’. The rates and patterns of ocean heat and carbon storage are 
influenced by ocean transport, suchas mixing and large-scale circulation* ”°. 


However, existing climate models do not accurately capture the observed patterns of 
ocean warming, with a large spread in their projections of ocean circulation and 
ocean heat uptake®". Additionally, assessing the influence of ocean circulation 
changes (specifically, the redistribution of heat by resolved advection) on patterns of 
observed and simulated ocean warming remains a challenge. Here we establisha 
linear relationship between the heat and carbon uptake of the ocean in response to 
anthropogenic emissions. This relationship is determined mainly by intrinsic 
parameters of the Earth system—namely, the ocean carbon buffer capacity, the 
radiative forcing of carbon dioxide and the carbon inventory of the ocean. We use this 
relationship to reveal the effect of changes in ocean circulation from carbon dioxide 
forcing on patterns of ocean warming in both observations and global Earth system 
models from the Fifth Coupled Model Intercomparison Project (CMIP5). We show that 
historical patterns of ocean warming are shaped by ocean heat redistribution, which 
CMIPS5 models simulate poorly. However, we find that projected patterns of heat 
storage are primarily dictated by the pre-industrial ocean circulation (and 

small changes in unresolved ocean processes) —that is, by the patterns of added heat 
owing to ocean uptake of excess atmospheric heat rather than ocean warming by 
circulation changes. Climate models show more skill in simulating ocean heat storage 
by the pre-industrial circulation compared to heat redistribution, indicating that 
warming patterns of the ocean may become more predictable as the climate warms. 


The increase in emissions of anthropogenic greenhouse gases such as 
CO, has led to rising atmospheric temperatures since pre-industrial 
times. The ocean absorbs most of the excess heat and some of the excess 
carbon, mitigating atmospheric warming”. However, the storage of 
heat and carbon by the ocean leads to adverse regional effects such 
as sea-level rise, coral bleaching and ocean acidification®’°" *. Ocean 
transport affects both global and regional ocean temperature and 
also carbon changes””’. In particular, changes in ocean transport can 
shape ocean warming patterns*”®", Yet, directly assessing the effects 
of changing ocean currents on ocean warming both in observations 
and also in climate models® remains challenging. In addition, there is 
alack of agreement between modelled and observed heat storage and 
also a large spread among different models in the simulated patterns 
of ocean heat storage—the causes for these discrepancies are difficult 
to attribute”. To diagnose the effect of changes in large-scale ocean 
circulation (namely, resolved advection) on ocean warming patterns 
across observations and model ensembles, we here exploit the intrinsic 
similarities between the ocean’s storage of heat and carbon. 


Previous studies have shown that changes in global atmospheric 
surface temperatures are proportional to cumulative CO, emissions’ *. 
This emergent linear relationship is driven by the ocean, and the lin- 
earity arises in part from the ocean processes that are responsible for 
the uptake of heat and carbon*’’ (‘uptake’ is the absorption of atmos- 
pheric anomalies by the ocean, and ‘storage’ is how these anomalies are 
stored spatially in the ocean). The result highlights a salient connection 
between the uptake and storage of heat and carbon in the ocean, set- 
ting the transient climate response to CO, emissions. Our study aims to 
quantify how changes in ocean processes and ocean circulation affect 
the ocean storage of heat and carbon at global and regional scales and 
we use our findings to establish the origin of ocean warming patterns. 

On the global scale, we find that the ratio of heat to carbon uptake 
by the ocean spans a narrow range across observations, models and 
scenarios. On the regional scale, patterns of ocean heat and carbon 
storage are identical if ocean circulation—where ‘circulation’ herein 
refers to resolved advection (see below)—is unaltered by anthropogenic 
climate change, in both a coupled’ and an ocean-only model. We use 
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these two findings to attribute changes in the patterns of ocean heat 
storage to ocean circulation changes in an ensemble of coupled climate 
models and also in observations. We find that circulation changes havea 
dominant effect on observed heat storage patterns. However, the effect 
of circulation changes on the patterns of ocean heat storage diminishes 
as the climate warms over the twenty-first century. 

To obtain our results, we initially use two 100-yr-long simulations 
of the Geophysical Fluid Dynamics Laboratory Earth System Model 
2 (GFDL ESM2M) forced with a transient increase of 1% atmospheric 
CO, per year’ (the 1%CO, experiment; see Methods). We analyse the 
results at the time of doubling of atmospheric CO,, averaged over 
years 61-80 of the simulation. One simulation allows ocean currents 
to evolve freely (the free-circulation experiment) and the other pre- 
scribes monthly mean horizontal Eulerian velocities (ocean currents) 
froma pre-industrial control simulation (the fixed-circulation experi- 
ment), removing changes in ocean circulation induced by CO, forc- 
ing. Throughout this text, we define circulation as the advection by 
Eulerian velocities only—we therefore do not include parametriza- 
tions such as eddy velocities and mixing, which are allowed to vary in 
the fixed-circulation experiment. However, the changes in the eddy 
transport and convection in the fixed-circulation experiment are small, 
except in ice-covered regions (see Extended Data Fig. 1). 

As aresult of increased CO, forcing, excess energy in the climate 
system is mainly stored inthe ocean. In our numerical experiments, the 
ocean heat storage in the fixed-circulation experiment—the added heat, 
H,q—represents the storage of excess atmospheric heat as a result of 
pre-industrial ocean circulation and the changes in sub-grid processes 
(mixing and mesoscale eddies) owing to increased CO, forcing. The 
ocean heat storage in the free-circulation experiment, H, additionally 
includes the effect of circulation changes. The difference between the 
total heat storage H and the added heat H,,, (both owing to anthropo- 
genic climate change), here defined as the redistributed heat, H,, is thus 
the redistribution of both the added and pre-industrial (background) 
heat throughout the ocean by the changing ocean circulation: 


H= Ag y H, (1) 


Heat redistribution from circulation changes can strongly alter 
regional patterns of heat storage**””, and local climate feedback such 
as from clouds and albedo’; however, heat redistribution has little effect 
on global anthropogenic ocean heat uptake: the difference in global 
ocean heat uptake between the fixed-circulation and free-circulation 
experiments is only 4% in ESM2M’ (this could vary slightly from model 
to model). A decomposition similar to equation (1) can be applied to 
any tracer anomaly in the ocean, including the anthropogenic ocean 
carbon storage, C,,,. We here refer to ‘anthropogenic’ heat and carbon 
as the anomalies from the pre-industrial state induced by increased 
atmospheric CO, concentration. All anomalies are calculated using 
20-yr temporal averages to minimize the effects of natural variability. 
Given that heat and carbon anomalies are concentrated in the upper 
ocean®”!°, and for comparison with observations, we focus on the top 
2,000 m vertically integrated fields. To verify our results from ESM2M, 
we also conduct similar simulations in an ocean-only model, here the 
Massachusetts Institute of Technology General Circulation Model 
(MITgcm), with free-circulation and fixed-circulation experiments, 
discussed in Methods. 


Heat and carbon storage in GFDL ESM2M 


In our ESM2M 1%CO, simulations, C,,, is concentrated in the mid 
and high latitudes, and is similar in both the free-circulation and 
fixed-circulation experiments, with a spatial correlation coefficient 
(CR) of 0.98 (Fig. 1a, b). The ocean redistribution of C,,, is negligible 
compared tothe total C,,,, (ref.’). The added heat in the fixed-circulation 
experimentis also predominantly stored in the mid and high latitudes 
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(Fig. le). In the fixed-circulation experiment, added heat is absorbed 
by the ocean as a result of air-sea exchange and is transported in the 
interior by background circulation and sub-grid processes. This heat 
storage H,, is dominated by physical processes such as Ekman pump- 
ing and vertical mixing, and is heavily influenced by the pre-industrial 
stratification”””°. However, in the free-circulation experiment, there is 
noticeably more heat stored in the low latitudes of the Atlantic Ocean 
compared to inthe fixed-circulation experiment, owing to the changing 
circulation (Fig. 1d)—either a change in gyre transport or overturning 
circulation’©”. Redistribution of heat (H,) from the changing circulation 
is evident in all ocean basins, not just in the Atlantic Ocean. The patterns 
of C,,., in the free-circulation and the fixed-circulation experiments, 
closely match H,,, with CRs of 0.92 and 0.93, respectively. Additionally, 
this similarity is evident in ocean-only MITgcm simulations, and also 
evident when comparing reconstructions of ocean heat and carbon 
storage using a constant ocean circulation’®”. 

The similarity of C,,,,and H,, is due to the effect of chemistry on ocean 
carbon uptake, through the Revelle buffer factor””. The spatial pat- 
tern of the buffer factor is largely determined by the pattern of the 
background ocean temperature (see Methods). By contrast, ocean 
uptake of other tracers suchas chlorofluorocarbon (CFC) lacks chemi- 
cal buffering, such that the ocean CFC storage does not have the same 
relationship with H,, as C,,,, does; H,, and C,,,, therefore have a unique 
similarity (see Methods for a detailed analysis). 


Global heat-carbon coupling 


Owing to the unique similarity between the spatial patterns of stor- 
age of C,,, and H,, in the simulations, the spatial pattern of H,,can be 
reconstructed using C,,, in the free-circulation experiment, without 
the need for a fixed-circulation experiment. To reconstruct Hq, We 
require an appropriate time-varying scaling parameter, a(t), such that 
Hg= Q(t)C,n. Here, the coefficient ais called the heat-carbon coupling 
parameter. acan be estimated locally using a regression of C,,,, onto Hg, 
although this still requires a fixed-circulation experiment. Although the 
patterns of ocean heat storage differ between the free-circulation and 
fixed-circulation experiments, the global integrals of anthropogenic 
heat and carbon content are nearly unchanged (Fig. 2a)’. Therefore, 
we opt to estimate a globally such that 


SHy,0dA Alo 
$ Caney, t)dA . Cant (0) , 


a(t) (2) 


where Hand C,,, are the global spatial integrals of Hand C,,,, respec- 
tively, and A is the surface area of the Earth. The value of ais generally 
in the range 2 x 10’-6 x 10’J mol when calculated for different model 
simulations, scenarios and observations (Fig. 2a). This relatively nar- 
row range of values for a represents a tight coupling between the 
global ocean anthropogenic uptake of heat and carbon in the climate 
system. 

To understand the leading terms that set the magnitude of a, denoted 
as &, we consider how atmospheric CO, changes drive the anthropo- 
genic ocean uptake of heat and carbon. Using the analytic expressions 
for the dependence of Hand C,,, on atmospheric CO, (derived in Meth- 
ods), we can express @ as a function of simple parameters of the Earth 
system. We assume that ocean warming #/is driven by the radiative 
forcing imbalance that stems from the strength of CO, as agreenhouse 
gas, Fy (inW m~). Inaddition, we assume that changes in ocean carbon 
uptake are directly driven by excess atmospheric CO, concentration 
and that C,,,, the amount of anthropogenic carbon absorbed by the 
ocean, is set by the capacity of the ocean to buffer CO, anomalies, 
expressed by the Revelle buffer factor*”, Rp (DIC, dissolved inorganic 
carbon). A lower value of Rp, and a higher total ocean carbon inventory 
/,canbothcontribute toa lower capacity for the ocean to absorb addi- 
tional CO,. We combine the relationships for the uptake of ocean heat 
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Fig. 1| Simulated anthropogenic changesinC,,,andH.a-f, Upper 2,000m 
vertically integrated anomalies in years 61-80 of the 1%CO, ESM2M simulation 
for tracers C,,,, (a-c) and H (d-f).a,d, C,,, and H relative to the tandem control 
simulation for the free-circulation experiment. b, e, Added C,,,,and added 
H(H,,) relative to the tandem control simulation fixed-circulation 


and carbon and, by neglecting climate feedback, we obtain an expres- 
sion for & given by (see Methods): 


_ Rak 
a(t) = AT At, (3) 


oO 


where Atis the time interval elapsed (see Methods for further details). 
In ESM2M, the estimated a (@) determined by equation (3) is of similar 
magnitude to the directly obtained a (equation (2)), particularly after 
the first 20 yr of simulations (during which time natural variability 
dominates over the anthropogenic response; see Extended Data Fig. 2 
and Methods). However, after a few decades, observations and models 
show little temporal dependence on a (Fig. 2b and Extended Data Fig. 2), 
probably owing to the importance of the feedback processes that are 
neglected in equation (3). The observed and modelled constant slope 
of Aversus C,,—that is, a (Fig. 2b)—therefore highlights that the ocean 
takes up heat at the same rate as it takes up carbon. This relationship 
is similar to the linear relationship between atmospheric warming and 
cumulative carbon emissions. Thus we determine that, to first order, 
the linear relationship between the global uptake of heat and carbon 
is determined by the capacity of the ocean to take up carbon and the 
strength of the radiative forcing of CO,. 


Past and future heat redistribution 


By exploiting this relationship between global heat and carbon uptake, 
H,,can be locally reconstructed from C,,,, using Hag(x,y,0=a(O)Cin(X%Y, 0), 
where a(¢) (equation (2)) is obtained for each individual simulation or 
observational product. The global-mean error in ice-free regions for 
the estimated H,, over the upper 2,000 m of the ocean, compared to 
the simulated H,, in ESM2M, is 11%. We obtain similar errors using dif- 
ferent depth integrals (shown in Fig. 3) or using an ocean-only model 
(see Methods). We can then infer vertically integrated patterns of redis- 
tribution, H,(x, y, 0, as residuals using equation (1). The reconstruction 
of depth-dependent profiles of zonal-mean heat redistribution is just 
as accurate below 300 m (Extended Data Fig. 3). 

We apply our method to observational products using spatially and 
temporally varying ocean heat storage (from ref.”3) and anthropogenic 
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experiments. c, f, The difference in C,,, and H between the free-circulation and 
fixed-circulation experiments, whichis attributed to the changing ocean 
currents—thatis, the redistributed values of C,,, and H (that is, H,). The black 
line shows the annual-mean maximum sea-ice extent. 


carbon (from ref.”), from 1951 to 2011 in the top 2,000 m of the ocean. 
The storage of anthropogenic heat and carbonis calculated by fitting 
linear trends locally over the period, multiplied by the time period 
considered (Fig. 4). We apply the same procedure to 11 Earth system 
models from the CMIPS archive with a full carbon cycle. Although 
there is a large inter-model spread in historical ocean heat storage 
among CMIP5S simulations (+71%), the CMIP5 ensemble mean agrees 
well with observations of net global ocean heat storage over the past 
40 yr". We use historical and climate change simulations (following 
the Recommended Concentration Pathway 8.5, RCP8.5) for 1951-2011 
and 2011-2060, respectively. We also reconstruct projections of redis- 
tributed heat using the 2011-2060 trends from CMIPS in the RCP8.5 
simulations (Fig. 4). 

Overall, the CMIP5 models reproduce the observed 1951-2011 added 
heat storage better than they dothe redistributed heat storage; the CR 
for the observed and simulated mean H,, is 0.83, whereas the CR for 
observed and simulated H, is 0.31. However, some features in the spatial 
patterns of H, over 1951-2011 are consistent between observations and 
the CMIP5 simulations, as well as in our ESM2M 1%CO, simulation; 
for example, cooling in the Pacific sub-tropical gyres and the Indian 
Ocean, and warming in the tropics. However, the ensemble mean of the 
CMIPS simulations underestimates the strength of tropical redistrib- 
uted warming compared to the observations, especially in the Atlantic 
Ocean. This is partly due to inter-model variability. In the Southern 
Ocean, the observed patterns of warming show substantial contribu- 
tions from added heat in all three sectors of the Southern Ocean: large 
redistributed warming in the Atlantic and Indian sectors, and weak 
cooling in the Pacific sector. The zonally integrated heat storage in 
the Southern Ocean is dominated by the added heat”, owing in part to 
the integration both over the redistributed warming anomalies (inthe 
Atlantic and Indian sectors of the Southern Ocean) and also over the 
cooling anomalies (in the Pacific sector). The CMIP5 models simulate 
more redistributed cooling inthe Pacific sector of the Southern Ocean 
compared to observations over the same period, but simulate a weaker 
redistributed warming feature in the Atlantic and Indian sectors, seen 
also in the Atlantic zonal mean (see Extended Data Fig. 4). 

Akey feature of H, inferred from the observations is the cooling effect 
of the redistributed heat in the North Atlantic subpolar gyre, consistent 
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Fig. 2 | Heat-carbon coupling. a, a calculated from (left to right): the 1951-2011 
trend inthe CMIPS ensemble historical simulations (ensemble mean in red); the 
1951-2060 trend in the CMIP5 ensemble RCP8.5 simulation (ensemble meanin 
orange); the 1951-2011 trend in observations of ocean heat content (white 
circle from ref. °°, filled circle from ref. 7°), andin observations of ocean 

carbon content~ (mean, blue); our ESM2M1%CO, modelafter 70 yr of the 
fixed-circulation (dark green) and free-circulation (light green) experiments; 
and fromthe perturbed MITgcm simulation after 70 yr (magenta). The grey 
dots show individual estimates; the green circles show ESM2M within the 
CMIP5 ensemble. The error bars show the 66% confidence interval stemming 
from the combined observational uncertainty inHand C,,,.b, Has a function of 


witha reduction inthe meridional ocean heat transport that is probably 
due to a reduction in the meridional overturning circulation or gyre 
transport”, which counteracts the warming from the added heat. This 
North Atlantic redistribution is not captured in the CMIP5 ensemble 
mean but is present ina few of the ensemble members (Extended Data 
Fig. 5). This could indicate either that the sub-polar cooling is due to 
natural variability (an anthropogenic signal in meridional heat trans- 
port only emerged in 2000)” and therefore eliminated owing to the 
averaging over multiple historical simulations, or that most CMIP5 ESM 
models fail to capture the emergence of a forced trend in that region 
because they contain structural biases”. 

In our projections from 2011 to 2060, in all CMIP5 models—includ- 
ing inthe three CMIP5 models that best match the observed patterns 
of H, from 1951 to 2011—H, exhibits a cooling signal in the Atlantic 
sub-polar gyre and warming in the Gulf Stream and in low-latitude 
regions (Extended Data Fig. 5). This agreement could indicate that 
the observed pattern of ocean redistribution from anthropogenic 
climate change might persist over the coming decades, despite the 
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Can Felative to the years 1870-1890, where the slope is equal to a. The CMIP5 
historical-RCP8.5 ensemble members are given up to the year 2060, where the 
ensemble mean is the thick solid pink line, the range is shaded in pale yellow 
and the thin solid pink lines show individual models. The dashed lines show the 
CMIP5 models that simulate a decrease in the ocean heat contentinthe 
twentieth century, which are excluded from the mean and range. The blue line 
shows observations (ocean heat storage from ref.*°), and the green shading 
shows the range of the fixed-circulation and free-circulation 1%CO, ESM2M 
simulations that we conduct here. The solid circles show the decadal means, 
and the white rings highlight the 2000-2009 mean. The blue shading shows 
the 66% confidence interval from observations. 


past influence of natural variability. However, it is important to note 
that model agreement does not necessarily mean that the models 
are correct. 

Heat redistribution explains most of the warming pattern over the 
historical period—H, has a CR of 0.92 with Hin observations (CR=0.87 
in CMIPS). However, the projected added heat becomes relatively more 
important between 2011 and 2060, with a CR of 0.52 for H,, with H, as 
opposed to CR= 0.27 in observations and CR=0.26in CMIP5 over the 
historical period. As the climate warms, added heat storage will there- 
fore probably begin to shape ocean warming patterns. 


Discussion 

We identify an emergent coupling between the global ocean uptake of 
heat and carbon. The order of magnitude of the heat-carbon coupling 
parameter is consistent across models, scenarios and observations 
and is mostly constant over time. The magnitude of the heat-carbon 
coupling parameter is, to first order, set by the ocean buffer factor, 
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Fig. 3 | Error inthe redistributed ocean heat storage from ESM2M. 

a-i, Ocean heat storage from the fixed-circulation experiment in terms ofthe 
added H (H,,), depth-integrated over 0-1,000 m (a-c), 0-2,000 m (d-f), and 
0-5,000 m (g-i), averaged over years 61-80 of the ESM2M 1%CO, simulation. 
a, dand gshowH,, estimated using only the free-circulation experiment 


the pre-industrial ocean carbon inventory and the radiative forcing 
strength of CO,. This similarity is unique in that it is not shared with 
other passive tracers such as ocean CFC storage. We use the global 
and regional coupling of heat and carbon to calculate patterns of 
the ocean heat redistribution that arises from circulation changes in 
observational-based products and in CMIP5 models from 1951 to 2060. 

The method allows us to attribute observed ocean warming and 
cooling to circulation changes on a global scale using only obser- 
vational products, thereby avoiding the biases in climate models 
that are associated with their simulated heat-storage patterns. In 
some ocean basins, the effect of warming from redistribution is more 
than three times larger than the effect of added heat storage in the 
fixed-circulation experiment, leading to an increased rise in regional 
sea levels. We find that warming from redistribution has had an impor- 
tant role inthe observed warming of the Atlantic Ocean between 40°S 
and 40°N and in the eastern tropical Pacific Ocean, up to 6.4 GJ m™ 
along the Gulf Stream. In contrast to warming from redistribution, 
changes in ocean circulation have probably cooled the sub-tropical 
gyres in both the Indian and Pacific Oceans by up to 2.4 GJ m”, leading 
to mitigation of local sea-level rise. In the Southern Ocean, there is 
redistributed warming in the South Atlantic and Indian sectors, up to 
4.0 GJ m”, but cooling in the Pacific sector, up to1.7 GJ m’, such that 
the zonally integrated Southern Ocean warming is set by the added 
heat’®. Over the historical period, some of these observed features 
are probably due to natural variability. However, both the Pacific 
and Indian sub-tropical redistributed cooling and Southern Ocean 
pattern are projected to continue into the future across all models®, 
suggesting that the historical pattern of redistribution may persist 
over the next several decades. 
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quantity aC,,,,and b,eand hshowH,, simulated by the fixed-circulation 
experiment. c, fandishow the difference between H,, as determined by the two 
experiments. The average percentage errors in the estimation of H,, using C, 
are 3%, 11% and 12% for the models over 0-1,000 m, 0-2,000 mand 0-5,000m, 
respectively. 


Simulations of the storage of ocean anthropogenic carbon—and by 
extension the added heat storage—within the CMIPS ensemble agree 
well with each other’’. Therefore, the differences in heat-storage 
patterns simulated in the CMIPS models are not due to the back- 
ground state, which governs the added heat storage, but are pre- 
dominantly due to the response of the ocean circulation to forcing 
in these models (natural or anthropogenic). In future projections, 
however, the influence of ocean heat redistribution on patterns of 
heat storage is reduced as the added heat becomes more dominant, 
despite changes in ocean circulation becoming more severe”’. In 
addition, the patterns of added heat storage in CMIP5 models agree 
well with observations. As added heat storage becomes more domi- 
nant, future CMIP simulations of ocean heat storage will therefore 
probably agree better with real patterns of warming than in the past. 
As aresult, long-term ocean warming and patterns of sea-level rise 
may become more predictable as the climate warms. However, we 
cannot exclude that at even higher atmospheric CO, levels than 
tested here, extreme and abrupt changes in circulation may have 
an important role. 

A main caveat of our work is the use of a single coupled climate 
model (ESM2M) to test our method. However, all key results found 
using ESM2M are also found in ocean-only simulations with MITgcm. 
Inthe MITgcm simulations, we also find a 4% difference in global heat 
uptake between the fixed-circulation and free-circulation experiments 
after 100 yr of simulation, that the redistribution of C,,,, is negligible, 
that patterns of carbon and heat storage in the fixed experiment are 
highly correlated, that a lies in the range of other coupled models, and 
that all of these results can be used to skilfully reconstruct fixed heat 
storage (see Methods). In addition, both the spatial similarity of C,,, 
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Fig. 4| Redistribution of ocean heat storage inthe upper 2,000 m. 

a,d,g, The change in heat storage H. b, e, h, The change in the added 

heat storage H,,, calculated as aC,,,.¢, f, i, The ocean redistributed heat 
H,=H- aC,,,. The changes are shownas the linear trends over the indicated 
periods multiplied by the length of the period. The results from observational 
estimates (obs.) for the period 1951-2011 are given in a—c; the results from 


under free-circulation and fixed-circulation conditions®”, and also 
the small effect of biological changes°””*° » on C,,,, are consistent with 
previous modelling work. Inthe fixed-circulation experiment, param- 
eterizations are still allowed to vary. However, the effect of changes 
in these parameterizations on the passive heat and carbon storage is 
found to be less than 10% (see Methods). Finally, the similarity between 
H,,and C,,, is also present when comparing passive reconstructions of 
the storage of ocean heat and carbon’®”. Another caveat of our work is 
that many models tend to underestimate variability, including air-sea 
carbon fluxes****. However, because we consider the multi-decadal 
time-integrated carbon storage, the effects of variability in the flux 
are mostly mitigated. 

In summary, we present a method to calculate the redistribution 
of ocean heat without the need for any additional measurements or 
simulations, and we infer changes in heat transport owing to anthropo- 
genic climate change. The method provides a way to assess the quality 
of climate models, by comparing the physical response to forcing with 
our observational estimate of ocean heat redistribution. We show that 
ocean heat redistribution, although dominant in the past, becomes 
less important in the future, leading to potentially more predictable 
warming patterns in coarse-resolution climate models. Further work 
is necessary to assess our results in eddy-rich ocean climate models, 
and to constrain the mechanisms and sensitivity of ocean heat to car- 
bon uptake and their potentially strong link to the transient climate 
response to CO, emissions'. However, our method can be used to 
understand the mechanisms which regionally redistribute ocean heat, 
impacting sea-ice formation®, marine heat waves**, ocean oxygen” 
and sea-level rise’. 
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1951-2011 changes in the CMIPS historical-RCP8.5 ensemble are given ind-f; 
and g-ishow the 2011-2060 changes in the CMIPS RCP8.5 ensemble. 

The hatching indicates where the changes are not significant at the 66% 
uncertainty level. For the observations, the uncertainty is from the estimates 
of Hand C,,,, whereas for the CMIPS models, the uncertainty is from the mean of 
the ensemble that emerges from the inter-model spread. 
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Methods 


Experimental design 

The main coupled climate model used in this study is the GFDL ESM2M, 
which includes a fully coupled carbon cycle***° forced witha transient 
increase of 1% atmospheric CO, per year’. To investigate the effect of 
changing currents, in addition to a free-running forced experiment, 
we use the fixed-circulation experiments from ref. ’”. In these experi- 
ments, the ocean currents (namely, the horizontal velocities) are 
prescribed to follow a fixed seasonal cycle. The prescribed (fixed) 
currents are taken from the first 100 yr of the 1860 control simula- 
tion. To maintain the mass balance and avoid unrealistic local sea 
levels, the ocean surface freshwater flux in the 1%CO, fixed-circulation 
experiment is also prescribed from the pre-industrial fixed-circulation 
control. Although the resolved currents are prescribed, the param- 
eterizations for subgrid-scale mesoscale eddy mixing and convec- 
tion are allowed to operate as normal and respond to the imposed 
CO, forcing. 

The experiments are run for 100 yr but herein we show the spatial 
maps of anomalies over the years 61-80, which coincide with the time 
at which the atmospheric CO, concentration is twice the pre-industrial 
value. In the ESM2M simulations, the difference in the net ocean heat 
uptake for the average of the years 61-80 between the fixed-circulation 
and free-circulation simulations is 4%. 

In the fixed-circulation experiment, there are some changes in the 
circulation owing to the eddy parameterization schemes (Gent-McWil- 
liams and vertical mixing), shown in Extended Data Fig. 1. However, 
these changes are negligible. Extended Data Fig. 1 also shows the mixed 
layer depth in the control simulation and calculated changes in the 
mixed layer depth in the free-circulation and fixed-circulation experi- 
ments. Changes in the mixed layer depths reflect changes in convec- 
tion. Although there are large changes in mixed layer depths in the 
free-circulation experiment in the North Atlantic (particularly in the 
Labrador Sea), there are only small changes in the ice-free regions of 
the fixed-circulation experiment, where convection is maintained. In 
the fixed-circulation experiment, in the North Atlantic north of 30°N, 
there is an average reduction in the mixed layer depth of 13%, where 
mixed layer depths were deeper than 200 m inthe control case. These 
results show that, outside ice-covered regions, the changes in the eddy 
and convection parameterizations in the fixed-circulation experiment 
are small. 


Similarity between C,,,.and H,,4 

To demonstrate the similarity between C,,,,and H,,, we investigate the 
relationship between heat and CFC storage from an RCP8.5 simulation, 
also at the time of the doubling of the atmospheric partial pressure of 
Co,, Poo, (Extended Data Fig. 6). Ocean uptake of atmospheric CO, is 
driven by the air-sea imbalance of the partial pressure of CO,. At the 
ocean surface, changes in P,o, are largely latitudinally uniform*!” 
(Extended Data Fig. 6), whichis a result of surface waters being mostly 
equilibrated with respect to well-mixed atmospheric CO,**“*. Changes 
in ocean Proy AP co, , therefore follow the nearly latitudinally uniform 
atmospheric’ co, changes, imposing a strong constraint onthe pattern 
of surface Poo," 

Peo, can be decomposed as the change in Pco, from temperature 
changes AT, and changes owing to chemistry, using linearity and the 
chain rule. In this case, we can express the contribution of chemistry 
through the changes in concentrations of DIC, ADIC, and of alkalinity 
(ALK), AALK: 


op op op 
APco,(T, DIC, ALK, S)* AT + = ADIC + 5 AALK, (4) 


where the contribution from changes in salinity is negligible compared 
to the other factors* in equation (4). The ADIC term dominates AP co 
2 


(Extended Data Fig. 6). Using the following expressions for the buffer 
factors for DIC and alkalinity (ALK), Rpic and Rar, respectively***: 


3 x ALK x DIC - 2 x DIC? 


Roic* (2 x DIC - ALK)(ALK - DIC)’ (>) 


ALK? 
ni 6 
Rax (2 x DIC - ALK)(ALK - DIC)’ (6) 


and anempirically derived relationship” for Pco,/OT, we canrewrite 
equation (4) as follows: 


= Roi co, 
Poo, {7 DIC, ALK) = 0.0423p,, AT+ pic ADIC é 
. RaikPco, HiGs RaikPco, re 
DIC ALK ‘ 


The sensitivity Po, /ODICis set by the background ocean chemis- 
try”, the spatial pattern of which is partly determined by the back- 
ground ocean temperature, as shown in Extended Data Fig. 6. To 
determine the role of temperature on Peo, /ODIC, we remove latitudi- 
nal variations in temperature: we first calculate the pre-formed DIC 
(that is, Cs,;, the component of DIC that is in equilibrium with atmos- 
phere, and the solubility of which is determined by temperature and 
salinity)** in the case of DIC. The pre-formed DIC was calculated twice, 
first using the simulated spatially varying background temperature 
field, and then using a uniform temperature field. The difference 
between the two is then removed from the actual simulated DIC to 
remove the effects of the spatially varying temperature. Poo, is also 
calculated with the same simulated and uniform temperature field, 
and the difference is used to remove the effects of the spatially varying 
temperature field. 

To satisfy the latitudinally uniform surface Ap,,. constraint, ADIC 
must be anti- “correlated to 0pco, /ODIC; inthe ESM2M, CR=-0.84, with 
similar correlations in other models and observations (Extended Data 
Fig. 7). The patterns of ADIC, which are dictated by OP co, /ODIC and 
therefore by the background state, are similar in both the 
fixed-circulation and free-circulation experiments (CR = 0.98), as 
expected. 

The patterns of ADIC are also found to correlate with the tempera- 
ture change in the fixed-circulation experiment (that is, the added 
temperature AT,,; CR = 0.64), and with the background temperature 
(CR=0.8). However, concentration changes of CFCs are not strongly 
correlated with AT,,. Similar to CO,, the ocean uptake of CFCs is set 
by the air-sea imbalance in partial pressure of CFC, p.;-, but unlike 
CO,, CFCs lack chemical buffering and the surface p,,- is only related 
to the CFC concentration by the inverse of solubility”, and the lack 
of chemical buffering of CFCs breaks the correlation between CFC 
changes and AT,,. 

Once carbon and heat anomalies are absorbed by the ocean at the 
surface, the interior C,,,,and H,, patterns are set only by the background 
physical processes, with little impact from changes in sub-grid pro- 
cesses, as illustrated using a ‘fixed-climate simulation” in which atmos- 
pheric CO, has no radiative forcing effect. In this simulation, the ocean 
only sees a pre-industrial climate, without physically driven changes in 
temperature and ocean circulation. Between the fixed-climate ESM2M 
simulation and the fixed-circulation and free-circulation simulations, 
Cant iS almost identical (Extended Data Fig. 8), confirming that tem- 
perature and circulation changes have negligible effects on C,,,”°. The 
fixed-climate simulation also shows that changes in biology have little 
effect, because modelled biological changes are primarily driven by 
physical changes in stratification that do not occur in the fixed-climate 
simulation ». Lastly, parameterizations such as convection, mixing 
and eddy transports are unperturbed in the fixed-climate simulation. 
The difference in C,,, between the fixed-climate and fixed-circulation 


experiments is less than 10% locally. These simulations therefore also 
show that the parameterizations have only a small effect on passive 
tracer storage and are consistent with results from previous studies 
with other CMIP5 models”*3*. 

The correlation between ADIC and fixed-circulation AT7,, at the sur- 
face owing to chemical buffering, as well as the dominance of back- 
ground physical processes in setting H,4 and C,,, explain their strong 
correlation. By contrast, the lack of correlation between changes in 
surface CFC and AT,, leads to a mismatch in storage patterns (Extended 
Data Fig. 6). CFCs therefore cannot be used to reconstruct H,,4, whereas 
Cyn UNiquely can. 


Relationship between global ocean anthropogenic heat and 
carbon 

This section outlines a more detailed derivation of equation (3). We 
express the top-of-the-atmosphere radiative imbalance®, M(t), as the 
imbalance between the radiative forcing from CO,, F(t), and the climate 
feedback, A(t)7,(t), such that: 


A(t) 

N(t) = F(t) -A(OT,(0) = AAt’ (8) 
where A(t) is the global heat uptake A since t= 0, Atis the time inter- 
val elapsed, A is the surface area of the Earth, 7,(0 is the mean atmos- 
pheric warming and A(t) is the climate sensitivity parameter. Note 
that in effect, At=t, but we use this notation to make it clear whena 
quantity is evaluated at a time ¢, and when the time elapsed appears 
as a Separate quantity At. Most of the excess heat is absorbed by 
the ocean such that A x M(t) ~ A(t)/At. The radiative forcing owing to 
CO, canbe calculated as F(t) = Foln[pco| (/P co; (0)], where F,=5.35Wm ~ 
is the radiative forcing coefficient for CO,. We can therefore write: 


F(t) =AQOT,(0) + N(t) = Fy Inlp¢o,(0)/Po,(0)I- (9) 


We assume linearity of the response in equation (9), as is commonly 
done, because we are only interested in an order-of-magnitude calcu- 
lation, and nonlinear responses are second order. Combining equa- 
tions (8) and (9), we obtain the heat content change over Atas: 

A) = AFo In [P¢o,(0/Pco, (OVIAt- AMO T(t. (10) 

Neglecting the anthropogenic carbon uptake by the land, and assum- 
ing that Poo, of the surface of the ocean is in equilibrium with the atmos- 
phere® (which follows the argument that the pattern of surface ocean 


Poo, does not change under climate change, as outlined above), wecan 
use the definition of the Revelle buffer factor Roc: 


APco, DIC _ Ah I, 
OCH KBIC Peo, ~ Bo hy’ 


(11) 


Here, /, is the pre-industrial global ocean inventory of carbon, /, = VDIC 
(where Vis the volume of the ocean), and /, is the atmospheric inven- 
tory of carbon. Inthe formal definition of R,,-, DICis the mean surface 
ocean DIC concentration. However, the ratio DIC/ADIC only differs 
by 1% in ESM2M if we consider the mean of DIC across the upper 
2,000 minstead of the surface of the ocean (and similarly in the esti- 
mate of total and anthropogenic ocean DIC from ref. ). The overline 
indicates an average over the ocean area (for Poo, ) and over the 
volume—of the upper 2,000 m, in our case—for DIC. We can therefore 
write: 

In Poo, = Injit+ RoicCant(O) 

Pco,(0) /,(0) 


Sra) AP co,{) a 


Pco,0) | Pco,(0) 


where C,,,(¢) =A/, is total ocean anthropogenic carbon. In our approxi- 
mation, /,(O) =/,, So we can re-write equation (10) as follows: 


A()= Can(t)At- AA()T,(0) At, 


RpicAFo 
r (13) 


fe) 


leading to the final equation for the linearly approximated heat- 
carbon coupling parameter @ in a one-dimensional model: 


a= oo 


— RoicAFo y AA(t) ee. 


Gy r At. (14) 


The left- and right-hand sides of equation (14) are shown in Extended 
Data Fig. 2, for the ESM2M 1%CO, simulation. The first term of the 
right-hand side of equation (14)—which depends on carbon buffer- 
ing effects—captures the magnitude of a, but has too steep a slope. 
The second term of the right-hand side of equation (14) acts to flatten 
out the behaviour of the coupling parameter with time. To first order, 
a therefore does not depend on A(t) or C,,,() explicitly, but instead 
is set by Fo, Rp and /,, which we have approximated as constants in 
this simple formulation. The results imply that the strength of CO, 
as a greenhouse gas, the buffer factor and the pre-industrial ocean 
carbon content (which is linked to the pre-industrial ocean stratifica- 
tion)—all of which are intrinsic quantities of the pre-industrial climate 
system—lead to a tight coupling between A and C,,,, and as a result a 
tight coupling between H,,and C,,,. The parameters Rp, and /, only vary 
up to 15% across different models, scenarios and observations, and so 
the spread in a across models and observations is due to the 7, term 
in equation (14), reflecting differences in non-CO, forcing agents and 
variations in various climate feedback processes. Over the historical 
period, natural variability in atmospheric warming trends probably 
also explains part of the large spread ina. 

The error in &(ft) as estimated using equation (14), compared to the 
diagnosed a(t) is due to the assumption of linearity, and that the land 
does not absorb any carbon. In realistic scenarios, the land is botha 
sink and a source of carbon, owing to differing land use. In 2009, the 
cumulative land sink and sources could balance, although the uncer- 
tainty on this number is large”. The error arising from land-use uncer- 
tainty is therefore alarge componentin the idealized 1%CO, simulation, 
but may have less importance in the real world. Similarly, the 1%CO, 
simulation also does not include radiative forcing from other green- 
house gases or aerosols. These effects are thought to be largely similar 
in magnitude but opposite in sign, and therefore toa large extent can- 
cel out®’. Nonetheless, equation (14) shows that, owing to intrinsic 
variables of the climate system, the relationship between H and C,,, is 
linear to first order, explaining why the values of a are similar across 
the ESM2M, MITgcm and CMIPS models and observations. 


MITgcm 
We compare the results from our ESM2M simulations to ocean-only 
simulations using the MITgcm, in which the atmospheric forcing 
remains constant. We do this, first, to demonstrate that our method 
holds for various modelling set-up and conditions. Second, the goal of 
using anthropogenic carbon to estimate the fixed-circulation anthro- 
pogenic heat content is to be able to apply this method to CMIP5 models 
and observations. In the ESM2M simulations, the atmospheric cir- 
culation and temperatures are different in the fixed-circulation and 
free-circulation experiments’. Using an ocean-only model with fixed 
boundary conditions, we can show that the coupling of the ocean’s heat 
and carbon storage is independent of ocean-atmosphere coupling. 
Our global MITgcm” set-up uses a 1° x 1° horizontal resolution and 
42 vertical levels with a realistic bathymetry. The model has a bio- 
logical module* that resolves five tracers: DIC, ALK, PO,, dissolved 
organic phosphorus (DOP) and O,. The model is forced at the surface 
with NCEP reanalysis wind stress and freshwater flux°’, and the surface 
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temperature and salinity are restored towards Levitus’ monthly tem- 
perature and salinity climatology” to avoid model drift. We use this 
ocean-only model to perform free-circulation and fixed-circulation 
climate change experiments that resemble those performed with the 
ocean-atmosphere coupled GFDL ESM2M described in the text. 

In testing the fixed-circulation experiments, the seasonality of the 
surface forcing alongside a prescribed yearly mean circulation field pro- 
duces strong atmosphere-ocean flux anomalies, leading to unphysical 
feedback. We therefore spin up the model both for the free-circulation 
and fixed-circulation experiments using October conditions without 
a seasonal cycle. October is chosen because it produces a mean-state 
circulation and air-sea carbon and heat fluxes most similar to the yearly 
climatology of the model that is forced with the full monthly varying 
forcing fields. The mean overturning and barotropic streamfunctions, 
Wrioc and Yoaro, respectively, are shown in Extended Data Fig. 9. 

To simulate anthropogenic warming due to CO, emissions, we per- 
turb the prescribed zonally uniform climatological temperature restor- 
ing profile with a latitudinally varying temperature perturbation. The 
idealized perturbation curve prescribes a smooth transition froma 
warming of 0 °C at 90°S, to 1.5 °C at the equator, to 0.5 °C at 90° N. We 
introduce anthropogenic carbon by raising the value of atmospheric 
Poo, from 280 ppm in the pre-industrial state to 560 ppm in the first 
time step of the climate change experiment, after which time the car- 
bon cycle model evolves freely (therefore adding a pulse injection of 
carbon). We perform this experiment with both fixed and free ocean 
circulation. 

In the free experiment, the circulation is changing owing to the 
changes in surface temperature, whereas in the fixed experiment 
the circulation (the 3D Eulerian and bolus velocity fields) is fixed 
and set to its pre-industrial value. However, parameterizations such 
as convective and eddy mixing are allowed to act freely for the both 
the free-circulation and fixed-circulation experiments, as in ESM2M. 
To remove the effects of the drift in the fixed-circulation warming 
experiment, we run a tandem fixed-circulation pre-industrial control 
simulation. All anomalies are calculated as the difference between the 
perturbation experiment and the tandem control simulation. After 
50 yr (100 yr), there is a3% (4%) difference in total ocean anthropogenic 
carbon uptake between the fixed-circulation and free-circulation exper- 
iments, and a4% (6%) difference in total heat storage, which agrees well 
with previous coupled climate simulations’. The total ocean heat and 
carbon uptake are therefore similar between the free-circulation and 
fixed-circulation experiments, although their spatial distributions 
are different. 

The perturbed overturning and barotropic streamfunctions, Yyoc 
and Yaaro, respectively, are shown in Extended Data Fig. 9. The experi- 
ments have been designed to optimally and rigorously test the pre- 
sented method. Therefore, our scenarios are idealized, inducing large 
warming-induced circulation changes, instead of inducing realistic cli- 
mate change scenarios from historical forcing or projections. However, 
our model simulations show very similar quantitative and qualitative 
results to coupled climate models under climate change scenarios as 
well as observational patterns of ocean warming and anthropogenic 
carbon storage’”**! °, 

The results for fixed-circulation and free-circulation 0O-2,000 m 
column-integrated anthropogenic heat and carbon storage, as well 
as for the carbon and heat redistribution, are shown in Extended Data 
Fig. 10. Extended Data Fig. 10 also shows the fixed circulation and 
redistributed heat content, as calculated using only free-circulation 
quantities and equations (1) and (2). Similarly to fully coupled climate 
model experiments’, discrepancies in the patterns of anthropogenic 
heat content between the free-circulation and fixed-circulation sim- 
ulations are apparent, whereas patterns of anthropogenic carbon 
storage remain largely unaffected by changes in ocean currents that 
arise from warming. After 70 yr, the anthropogenic heat storage in 
the fixed experiment, H,,, has increased in all regions, whereas in 


the free experiment, the ocean heat storage H has decreased in the 
North Atlantic and more heat is stored in the low latitudes inthe South 
Atlantic. The pattern of anthropogenic heat storage in the Atlantic is 
due to the redistribution of heat by the modified circulation as the 
weakening of the Atlantic meridional overturning circulation can 
lead to a reduction of heat transport from the South Atlantic to the 
North Atlantic”*!. The redistribution of anthropogenic carbon 
however, is much smaller. 

The fixed heat that is calculated using equations (1) and (3) (Extended 
Data Fig. 10) agrees very well with the explicitly simulated fixed heat. 
The corresponding redistributed heat patterns are also similar to those 
calculated using the fixed-circulation experiment, demonstrating 
the usefulness of our method for models with different structure and 
forcing. 


Significance testing 

To test the statistical significance of the redistributed heat inthe CMIP5S 
ensemble, we test at the 66% confidence level. The redistributed heat 
is not significant where the mean signal is less than o/./N, where ois 
the standard deviation of the signal and Nis the number of CMIP5 mod- 
els, N=11. The mean state has a zero mean and zero standard deviation 
of redistributed heat. 

To test the significance of the redistributed heat in the observations, 
we test at the 66% confidence level by calculating whether the signal 
is larger than the lo confidence interval. The 1o confidence interval 
is determined by quadratically combining the 1o uncertainty in the 
observational estimates of heat and carbon storage. 


CMIP5 

The 11CMIPS models used in this study are the following: CMCC-CESM, 
CNRM, IPSL-CMSA LR, IPSL-CMSA MR, IPSL-CMS5B LR, HadGEM2-ES, 
HadGEM2-CC, MPI ESM MR, NorESM, GFDL ESM2M and GFDL ESM2G. 
All of these models are Earth system models with fully coupled carbon 
cycles. Heat and carbon anomalies are calculated with respect toa 
tandem 1850 pre-industrial control simulation. 


Data availability 


The code of the GFDL ESM2M model is available from https://github. 
com/mom-ocean and the results from the free-circulation and 
fixed-circulation 1%CO, simulations are available from https://doi. 
org/10.1594/PANGAEA.914424. CMIP5 model data are available from 
the Earth System Grid Federation data portal at https://esgf-node.IInl. 
gov/projects/cmip5/. The historical ocean temperature reanalysis” 
from is available at http://159.226.119.60/cheng/. The estimate of his- 
torical ocean anthropogenic carbon” is available at https://tdevries. 
eri.ucsb.edu/models-and-data-products/. The observed ocean heat 
uptake” is available at https://laurezanna.github.io/publication/ 
zanna-et-al-2017b/. The MITgcm codeis available at http://mitgcm.org/. 
The GLODAPv?2 synthesis product is available at https://www.nodc. 
noaa.gov/ocads/oceans/GLODAPv2/. The topographical data used in 
Figs. 1, 3, 4 and Extended Data Figs. 1, 5, 8-10 are available in MATLAB 
and provided by NOAA“. The CMIP5 and observational estimates of 
total, added and redistributed heat are available at https://laurezanna. 
github.io/publication/bronselaer-zanna-2020/. 
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Extended Data Fig. 2| The heat-carbon coupling parameter, a. 

a, a, obtained in ESM2M using a(t) = A(t)/C,,,(6) (pink), and as estimated using 
equation (14) for & (grey line). The green line shows the magnitude of the first 
term of the right-hand side of equation (14), and the blue line shows the 
second term of the right-hand side of equation (14), whichis related to the 
temperature-feedback term. The grey shading shows the error in the estimate 
that is due to the assumption of linearity and to various other processes and 
feedback mechanisms ignored, suchas land carbon uptake. The negative 
values of aand @ at the start of the experiment are due to natural variability 
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when theexternal forcing is small (giving alow signal-to-noise ratio). 

b, a(t) =A(0/C,,,(t), relative to the years 1870-1890 in CMIPS historical/RCP8.5 
simulations, with the mean given by the thick pink line and the range given by 
the pale yellow area. Thin lines show individual models (ESM2M is shownin 
green for reference) and dashed lines show the models that simulate ocean heat 
loss inthe mid-twentieth century that are excluded from the mean and range. 
The blue solid line shows the observations (using ref. for heat) and its blue 
shading shows the 66% confidence interval from observations. 
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Ocean redistributed heat H,,4(y, Z, 2) = a(t)Can(y, Z, ) interms of temperature 
instead of than heat. Changes are shownas the linear trends over the indicated 
periods multiplied by the length of the period.a, b, The results from 
observational estimates for the Atlantic basin (a) and the Pacific basin (b). 
c,d, Changes inthe CMIPS historical RCP8.5 ensemble. The black horizontal 
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line indicates the 2,000 m depth level above which we show the observations. 
The hatching indicates where the changes are not significant at the 66% 
uncertainty level. For the observations, the uncertainty is from the estimates 
of Hand C,,,, whereas for the CMIPS models the uncertainty is from the mean of 
the ensemble due to inter-model spread. 
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Extended Data Fig. 6| Latitudinal profiles of ocean carbon and CFCs. 

a, ESM2M zonal-mean contribution of DIC (green), DIC+ALK (blue) and 
DIC+ALK+T (red) towards the change in ocean surface Poo; Changes in Pro, 

(AP eo), are roughly equal to (OP ¢9,/ADIC)ADIC. b, ESM2M zonal-mean mixed 
layer change, relative to the pre-industrial control, in ADIC (green), 
fixed-circulation added temperature AT,, (red), and ACFC 12 (purple). The blue 
line shows the sensitivity term @P ¢o,/ODIC. c, ESM2M zonally integrated C,,,, 
(green), CFC12 (purple) and H,, (red) storage in the upper 2,000 m relative to 
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the pre-industrial control. Solid single lines show year 70 fromthe 
fixed-circulation 1%CO, and double lines show year 2060 of the ESM2M RCP8.5 
simulation. d, ESM2M 2P¢o,/ADIC with alatitudinally varying temperature 
(solid) anda latitudinally constant temperature (dotted). e, Observed zonally 
integrated C,,, (green) and CFC 11 (purple) storage in the upper 2,000 min 

year 2017, from ref.*. The quantities in b-d have been scaled by the global 
mean value for each tracer, and the blue line in b has been offset upwards by a 
scaled value of 1 for clarity. 
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year 70 of the 1%CO, simulation with IPSL-CMSA-LR (c), and year 2011 from 
observations (using GLODAPv2 for mean states fields®’ and ADIC from ref.”*; d). 
The thin lines in b and d show the 66% confidence interval. P-values from 
observations are calculated using CO2SYS®. The correlation coefficients 
between ADIC and @P ¢o,/ODICare -0.88 and -0.82 for IPSL-CM5A-LRand 
observations, respectively. 
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Extended Data Fig. 9 | MITgcm ocean circulation. a, b, The mean-state (a) and 
overturning (b) barotropic streamfunctions in the MITgcm control simulation, 
where1Sv=10° m’s™. c,d, The perturbed (c) and overturning (d) barotropic 
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streamfunctions in the MITgcm climate change simulation after 70 yr of 
warming (taken as the mean of years 61-80). 


Extended Data Fig. 10 | MITgcm simulated anthropogenic tracer changes. 
a-d, Upper 2,000 mcolumn-integrated anomalies in the MITgcm climate 
change experiments, for free-circulation carbon (a), free-circulation heat (b), 
fixed-circulation carbon (c), and fixed-circulation heat (d). e, f, The 
re-distributions of carbon (e) and heat (f) are taken as the difference between 
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the free-circulation and fixed-circulation anomalies. Anomalies are taken as 
the difference between years 61-80 of the perturbed run and atandem control. 
g,h, The fixed-circulation heat estimated using only the free-circulation 


experiment quantities (g), and the corresponding estimated redistributed 
heat (h). 
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Tropical soils contain one-third of the carbon stored in soils globally’, so 
destabilization of soil organic matter caused by the warming predicted for tropical 
regions this century’ could accelerate climate change by releasing additional carbon 
dioxide (CO,) to the atmosphere® °. Theory predicts that warming should cause only 


modest carbon loss from tropical soils relative to those at higher latitudes>’, but there 
have been no warming experiments in tropical forests to test this®. Here we show that 
in situ experimental warming of a lowland tropical forest soil on Barro Colorado 
Island, Panama, caused an unexpectedly large increase in soil CO, emissions. Two 
years of warming of the whole soil profile by four degrees Celsius increased CO, 
emissions by 55 per cent compared to soils at ambient temperature. The additional 
CO, originated from heterotrophic rather than autotrophic sources, and equated toa 
loss of 8.2 + 4.2 (one standard error) tonnes of carbon per hectare per year from the 
breakdown of soil organic matter. During this time, we detected no acclimation of 
respiration rates, no thermal compensation or change in the temperature sensitivity 
of enzyme activities, and no change in microbial carbon-use efficiency. These results 
demonstrate that soil carbon in tropical forests is highly sensitive to warming, 
creating a potentially substantial positive feedback to climate change. 


Tropical forests havea large roleinthe global carbon cycle because they 
exchange more CO, with the atmosphere than any other ecosystem, 
contain roughly two-thirds of terrestrial plant biomass? and harbour 
a substantial fraction of global soil carbon’. Between 30% and 50% of 
the carbon respired from tropical forests originates from soil, most of 
which is derived from the decomposition of organic matter’ ”. Thus, 
even asmall increase in respiration from tropical forest soils could have 
alarge effect on atmospheric CO, concentrations, with consequences 
for global climate. 

There is considerable concern that increased global temperatures 
will destabilize soil carbon and increase the flux of CO, from soil to 
the atmosphere® °. Experiments in temperate and arctic regions have 
consistently found that short-term warming causes a considerable 
increase in soil CO, efflux compared to soil at ambient temperature**” 
(for example, a 37% increase over 2 years in temperate forest)°. For 
the tropics, the response of soil carbon to warming is expected to be 
smaller than at higher latitudes because kinetic theory predicts that 
the intrinsic temperature sensitivity of reaction rates is reduced at 
higher temperatures*”. Consistent with this, a meta-analysis of warm- 
ing experiments has shown that the temperature sensitivity of soil CO, 
efflux increases with latitude’. However, the extent to which intrinsic 
temperature sensitivity translates into actual (‘apparent’) temperature 
sensitivity depends on covariation of other environmental factors that 
influence respiration, such as soil moisture and substrate availabil- 
ity’. As there have been no in situ warming experiments conducted 
in tropical forests, the apparent temperature sensitivity of soil organic 
matter in this biome remains unknown. As aresult, earth system models 


continue to use kinetic theory to define the temperature sensitivity of 
soil carbonin the tropics”, limiting how well they predict the response 
of tropical forests to global environmental change”””’. 

Several factors could influence the apparent temperature sensitivity 
of soil organic matter. For example, soil warming is typically accompa- 
nied by soil drying, which can either reduce respiration in aerobic soils 
by reducing water availability, or increase respiration in waterlogged 
soils by increasing oxygen availability**. Warming can also affect 
respiration rates by inducing changes in biotic processes, such as the 
physiology or community composition of microbes, or changes in 
substrate availability to decomposers“. In experiments performed at 
higher latitudes, temperature-adaptive or compensatory responses 
of microbial communities and enzyme activities have been shown to 
modulate the effect of warming on the soil carbon cycle*™. Inthe trop- 
ics, future new temperature maxima could exceed critical biochemical 
thresholds®?”°, with added complexity emerging from altered inter- 
actions among species-rich plant and microbial communities” and 
covarying changes in hydrological or nutrient cycles®. 

Here we present results from the first warming experiment in 
a tropical forest (SWELTR: Soil Warming Experiment in Lowland 
Tropical Rainforest). The experiment tests the response of the whole 
soil profile to the 4 °C warming predicted for tropical latitudes by 
the end of this century” (Fig. la, Extended Data Figs. 1, 2). SWELTR 
consists of five pairs of circular control and warmed plots distrib- 
uted evenly within approximately 1 ha of semideciduous moist low- 
land tropical forest on Barro Colorado Island, Panama. The soils 
are moderately weathered Dystric Eutrudepts (Inceptisols) that 
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Fig. 1| Soil temperature and moisture content in control and warmed plots 
by depth. a, Soil temperature. b, Volumetric soil moisture content. Data are 
measurements from integrated soil temperature and moisture probes for the 
two-year period after the warming treatment began (December 2016 to 
December 2018). The vertical lines in a (dotted blue and dashed red) show the 
mean soil temperature across the soil profile, which is 26.18 °C for control plots 
and 30.14 °C for warmed plots (3.96 °C difference). The treatment effect was 
P<0.001 for all comparisons, based onthe temporal variation acrossn=5 
plots. Box plots are standard Tukey plots, where the centre line represents the 
median, the lower and upper hinges represent the first and third quartiles, and 
whiskers represent + 1.5 the interquartile range. The treatments are offset on 
they axis (depth) for easier viewing. 


have developed on the volcanic facies of the Bohio formation 
(Extended Data Table 1, Methods). Each warmed plot has a ground 
surface area of around 20 m’ heated to 1.2 m depth, resulting ina total 
volume of 120 m? of warmed soil across the experiment. 

Two years of experimental warming increased soil CO, emissions 
by 55%, from 18.8 + 1.9 megagrams of carbon per hectare per year 
(Mg Cha yr”) in control plots to 29.2 + 5.0 Mg Cha" yr‘in warmed 
plots (treatment effect, P< 0.05; Fig. 2, Extended Data Table 2, Extended 
Data Fig. 3). The soil CO, emission rate from unheated (control) plots 
is representative of tropical forests worldwide (8-40 Mg Cha‘ yr’), 
including in the Amazon basin (12-24 Mg C ha‘ yr“)”. Using exclu- 
sion and ingrowth cores to partition respiration from heterotrophic 
(soil-derived) and autotrophic (root-derived) sources, we find that 
the increase in CO, efflux was derived predominantly from hetero- 
trophic sources, whether from the decomposition of leaf-litter inputs or 
from pre-existing soil organic matter (Figs. 2,3, Extended Data Fig. 4). 
Soil-derived respiration increased from 12.0 + 2.1Mg Chat yr‘incon- 
trol plots to 20.1+ 4.2 Mg C ha‘ yr‘ in warmed plots (a 68% increase 
of 8.2Mg Cha‘ yr“; treatment effect, P< 0.05), whereas root-derived 
respiration was notaltered significantly (P=0.21;6.8+1.2MgCha‘yr? 
and 9.0 +3.4 Mg Cha‘ yr ‘in control and warmed plots, respectively; 
Fig. 3, Extended Data Table 3). 

It is possible that the large warming-induced increase in soil CO, 
efflux was due in part to soil drying because the warmed plots were 
slightly drier than the controls, particularly in the early wet season 
(Extended Data Fig. 2). In very wet soils, soil drying can increase respira- 
tion by increasing the supply of oxygen to microbes. By contrast, soil 
drying under aerobic conditions can reduce heterotrophic respiration 
by promoting water limitation of microbial metabolism”°. Here, we 
find a marginally non-significant effect of soil moisture on CO, efflux 
across seasons (Extended Data Table 2; annual, P= 0.69; wet-season, 
P=0.07; dry-season, P= 0.06), consistent with the parabolic relation- 
ship of CO, efflux with soil moisture for this site”. However, there was 
no direct effect of warming on soil moisture (surface soils, P= 0.19; 
whole-profile, P= 0.24; Extended Data Table 4) and the interaction 
between soil moisture and warming in the CO, efflux model was not 


significant (annually and for individual seasons; P > 0.2; Extended 
Data Table 2), indicating that the warming effect on CO, efflux was not 
influenced by soil moisture. Furthermore, soil moisture was not cor- 
related with soil CO, efflux inthe warmed plots (Extended Data Fig. 3), 
and drying during the early wet season in warmed soil (Extended Data 
Fig. 2) should decrease rather than increase CO, efflux because the 
soil was aerobic during this period and below the moisture content 
of 0.45 m? mat which soil CO, efflux peaks in this forest”. Our data 
thus show that although soil moisture influenced soil CO, efflux and 
that warmed plots were slightly drier than control plots, particularly 
during the early wet season, this did not contribute significantly to 
the increased CO, efflux from the warmed soil. 

There was no moderation of the warming-induced increase in soil 
CO, efflux over the two years of the experiment. Such a moderation 
might be expected in the long-term, through substrate limitation, 
adaptation of microbial communities (through changes in microbial 
carbon-use efficiency; CUE) or thermal compensation of enzyme 
activities (reduced maximum potential activity (V,,,,,) at higher tem- 
peratures)*"*”3, We found no reduction in extractable or mineralized 
nitrogen or phosphorus with warming, as would be expected under 
nutrient limitation (Extended Data Fig. 5). Almost all hydrolytic 
enzymes were unaffected by warming (Extended Data Fig. 5), except 
for B-xylanase (an enzyme involved in hemicellulose degradation), 
for which activity increased with warming during the wet season— 
an opposite response to that predicted by thermal compensation”. 
The temperature sensitivity of enzyme activity (Qio Of Vina) Was unaf- 
fected by warming (Extended Data Fig. 5a), indicating no dampening 
effect on soil carbon breakdown via thermal adaptation of enzyme pro- 
duction as expected at warmer temperatures”. Microbial CUE, which 
broadly represents carbon stabilized in biomass relative to carbon 
lost in respiration and can influence long-term (decadal) soil carbon 
loss?”3, was unaffected by warming (treatment effect, P= 0.37; Extended 
Data Fig. 5). Microbial biomass carbon, however, increased with warm- 
ing at the annual scale (treatment effect, annual scale, P= 0.02), with 
a marginally non-significant increase at the seasonal scale (P< 0.1; 
Extended Data Fig. 5). Together, these results suggest slightly increased 
growthin response to greater organic-matter turnover in the absence 
of nutrient constraints to carbon degradation, which did not translate 
into changes in CUE. 

Our finding that tropical forest soil carbon has a high apparent 
temperature sensitivity challenges the prevailing expectation that 
the temperature sensitivity of soil carbon is lower in the tropics com- 
pared to cooler ecosystems at higher latitudes*”. The 55% increase in 
total soil CO, emissions we report here (from 18.8 to 29.2 Mg Cha‘ yr”) 
following two years of 4 °C whole-soil-profile warming is larger than that 
found inatemperate forest using a similar whole-profile experimental 
design (34%-37% increase over two years of 4 °C whole-profile warming, 
from 13.0 to 17.5 Mg Cha‘ yr’)®. Furthermore, the rate of additional 
soil carbon loss (8.2 Mg C ha” yr) is greater than for all the studies 
ina recent meta-analysis of surface-only soil warming experiments 
at higher-latitude sites (all loss rates less than 5 Mg Chat yr™)*. The 
expectation that the temperature sensitivity of soil carbon breakdown 
is lower in the tropics compared to higher latitudes, based in part on 
kinetic theory and commonly used to describe soil carbon responses 
in earth system models”, is therefore not consistent with the (‘appar- 
ent’)*"5 temperature sensitivity of the breakdown of tropical forest soil 
carbon reported here. However, our results are consistent with recent 
atmospheric and satellite measurements, which have shown a high 
sensitivity of ecosystem-scale carbon cycling in tropical regions in 
response to interannual temperature variation”>”®. Our findings suggest 
that tropical soils contribute substantially to these ecosystem-scale 
responses to warming. 

This high apparent temperature sensitivity of tropical forest soil 
carbon under in situ experimental warming must arise through the 
temperature response of covarying ecosystem properties rather than as 
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Fig. 2 | Soil CO, efflux from control and warmed soils 
over two years. a, Total soil CO, flux during the study 
period (2017-2019), relative to the beginning of the 
warming treatment (relative day 0). b, Difference in 
carbon emission (warmed minus control) partitioned into 
soil-derived and root-derived components. Measurements 
were made every two weeks. Points ina represent the mean 
value of five plots, with error bars representing one 
standard error of the spatial variation (n=5 plots). The box 
plots represent the temporal variation over sequential 
100-day periods to show seasonal dynamics. Box plots are 
standard Tukey plots, where the centre line represents 

the median, the lower and upper hinges represent the 

first and third quartiles, and whiskers represent + 1.5 the 
interquartile range. The dotted vertical line shows when 
installation and testing of warming plots began (during 
this period each plot was warmed by 4 °C relative to 
controls fora period of 1-2 weeks); the dashed vertical line 
(relative day 0; 1 November 2017) shows when all five 
warming plots were switched on permanently. The yellow 
shaded areas represent dry seasons (1January to 1 April). 
Soil CO, efflux was significantly higher in warmed plots for 
annual data and for dry or wet seasons individually 
(Extended Data Table 2). 
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the sole consequence of intrinsic kinetic processes. Although our data 
do not provide conclusive evidence for the mechanism(s) underlying 
this marked increase in soil carbon loss from warmed soil, several find- 
ings point toa possible explanation: (i) the general absence of thermal 
compensation in enzyme activities (no decrease in V,,,,,); (ii) the lack 
of change in the temperature sensitivity of enzymes under warming 
(no decrease in Q,) of V,,,); and (iii) the lack of a moderating thermal 


response of microbial CUE (Extended Data Fig. 5)”*”””°. Together, our 
results indicate that organic-matter degradation increased under 
warming with no moderating responses or acclimation among micro- 
bial communities or the enzymes they synthesize. 

This surprisingly large loss of soil carbon from warmed soil rep- 
resents a substantial positive climate feedback over the period of 
this study. The additional carbon loss from warming observed here 
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Fig. 3 | The annual carbon emission partitioned into soil-derived and 
root-derived components. a, The pre-treatment period (January-May 2016; 
predominantly dry season). b, The two years with warming. The figures show 
total CO, and the root-derived and soil-derived components calculated using 
CO, efflux from partition cores (equations (1) and (2) in Methods). Differences 
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between control and warmed plots are shown by asterisks where P< 0.05 or as 
non-significant (NS); determined using mixed effect models (Extended Data 
Tables 2, 3). Box plots are standard Tukey plots, where the centre line 
represents the median, the lower and upper hinges represent the first and third 
quartiles, and whiskers represent + 1.5 the interquartile range (for n=5 plots). 


is of similar magnitude to annual carbon input from litterfall at this 
site (5-7 Mg C ha‘ yr“)” and is equivalent to approximately 13% of 
the total soil carbon stock, or 30% of gross primary productivity 
(27.5 Mg Chat yr“)°. Extrapolation of the first two years of carbon 
loss in our experiment across the entire tropical forest soil carbon stock 
(502 Pg C)' indicates a global loss of more than 65 Pg C with 4 °C warming 
this century, which is in broad agreement with estimated carbon loss 
based on a five-year soil translocation experiment in tropical forests 
elsewhere”. In light of these findings, earlier estimates of global soil 
carbon loss under 4 °C warming (110-190 Pg C)?”, which were based 
on experiments performed at higher latitudes, underestimate the 
magnitude of this global earth-atmosphere feedback. 

We expect that the rate of soil carbon loss will eventually decline in 
warmed soils as substrate limitation increases, but we do not know 
how long this will take, nor whether the long-term soil carbon balance 
will be affected by plant-soil interactions or changes in soil microbial 
communities as they adapt to warmer temperatures?*"*’, The nature 
of these longer-term responses will determine the strength of this 
positive earth-atmosphere feedback, which is already substantial in 
the short-term, in contributing to further climate warming. 
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Methods 


Site and experiment 

The experiment is situated in approximately 1 ha of seasonally moist 
lowland tropical forest on Barro Colorado Island, Panama®’. Within the 
plot area, the dominant tree species include Anarcardium excelsum 
and Poulsenia armata. The soils are Inceptisols (fine, isohyperthermic, 
Dystric Eutrudepts) that are rich in clay (about 54% profile-weighted 
clay concentration) and secondary metal oxides. They are developed 
onthe volcanic facies of the Bohio formation, a basaltic conglomerate 
of Oligocene age**. Although these moderately weathered Inceptisols 
are less infertile than soils under larger areas of lowland tropical for- 
ests, Inceptisols still account for 14% of total land area in the tropics 
(Ultisols and Oxisols account for 20% and 23%, respectively)*, and soil 
respiration in the control plots in our experiment is comparable to 
that in lowland tropical forests in general, including those on Ultisols 
and Oxisols”. 

The SWELTR experiment consists of ten circular plots (five paired 
‘warm’ and ‘control’ plots). Each plot measures 5 m diameter, with 
approximately 10 m between each pair of control and warmed plots and 
a minimum of 20 m between different plot pairs. The warmed plots 
contain two heating structures, each consisting of eight 1-m-long 
stainless-steel rods, connected by approximately 50 cm of flexible 
stainless-steel conduit. We used stainless-steel T-junctions at the top 
of each rod (adjoining the flexible conduit) and conical caps at the 
bottom of each rod. The final structure was 1.2 m tall. Inside each of 
the structures, we threaded 25 m of heating cable (SLMCAB10120BF, 
Briskheat) and filled the remaining space in the rods and conduit with 
quartz sand, selected for its high thermal conductivity. The complete 
structure was welded to seal the heating cable and sand inside. Two of 
these structures were buried around a3.5-m-diameter circumference, 
with the top of the flexible conduit 5 cm belowground. Thus, each plot 
contained 50 m of heating cable inserted to 1.2 m depth, encircling a 
3.5-m-diameter area, with an effective heated plot area of 5 m diameter. 
The experiment heats approximately 120 m’ of soil in total (five plots, 
5mdiameter, 1.2 m depth). The plot design and heating methodology 
follows previous studies®”®. 


Experiment power supply 

The experiment was powered using a 120-V supply, delivered froma 
mains outlet situated approximately 300 m from the experimental site. 
The power cables from the mains outlet to each of the five experiment 
control units were placed inside 19-mm-diameter liquid-tight flexible 
steel conduit and buried in roughly 20-cm-deep trenches. For each of 
the five control units (situated next to each warm-control plot pair), we 
installed a20-A circuit breaker and a12-V outlet, supplied by a battery 
connected to a 12-V battery maintainer. The 120-V outlets provided 
power to the heating cables, gas analyser system and battery main- 
tainers; the 12-V outlet provided power to the dataloggers, relays and 
sensors. Maximum power consumption is 2,500 W (500 W per plot), 
although average consumption is roughly 500-1,000 W. 


Temperature control 

Each warming and control plot was connected to a thermostat sys- 
tem, which maintained soil temperature in the warmed plots at 4 °C 
above ambient temperature. The thermostat system consisted of three 
integrated temperature and moisture sensors per plot (CS655 Reflec- 
tometer, Campbell Scientific; 20 cm in length), inserted to depths of 
0-20 cm, 50-70 cm and 100-120 cm at the mid-radius point in each 
plot, which were connected to a control unit (one control unit for each 
plot pair; five in total). The CS655 sensors continuously measured soil 
temperature to control the thermostat and logged the soil tempera- 
ture and volumetric soil moisture every hour. These whole-profile 
measurements were complemented by independent measurements 
of surface (0-20 cm) soil temperature and moisture (see below). 


The control units consisted of waterproof (IP68) enclosures containing 
asolid-state power controller (DA10-24C0-0000, Watlow), relay (12-V 
single channel), datalogger (CR1000, Campbell Scientific), and12-V and 
120-V power supply. Temperature in each warmed plot was therefore 
maintained at 4 °C above the temperature in each corresponding paired 
control plot, based on the average temperature from 0 cm to 120 cm 
depth at the mid-radius point in each plot. The average temperature 
difference over two years was 3.96 °C, which corresponds to the aver- 
age of 2.7 °C at O-20 cm depth, 4.0 °C at 50-70 cm depth and 5.2 °C at 
100-120 cm depth. 

This experimental design has been shown to warm the soil approx- 
imately uniformly across the soil volume, with minor anomalies of 
warmer Soil very close to the heating rods (within 10 cm) and slightly 
cooler surface soils asa result of heat-transfer to the air®**. Therefore, 
surface soils were slightly cooler compared to subsoils, although the 
response of surface soils (rather than subsoils) will probably dominate 
the warming response across the soil profile because they contain 
greater organic matter (two-thirds of the carbon stock occurs in the 
upper 50 cm of the soil profile; Extended Data Table 1). The heating 
structures were installed during May-July 2016, and plots were tested 
during June-October 2016. The testing phase consisted of heating 
each plot by 4 °C for approximately 2-week periods. The experiment 
was switched on in full on 1 November 2016. 


Soil gas-exchange and partitioning 

Soil CO, efflux was measured every two weeks from 2016 until 2019 at 
four systematically distributed locations within each plot using an infra- 
red gas analyser (IRGA Li-8100; LI-COR Biosciences). The measurement 
period was increased to every week during seasonal transitions. The 
soil collars for CO, efflux measurements were assigned to zones within 
each plot (‘centre 1’, ‘centre 2’ ‘side’ and ‘side 2’) and were relocated ran- 
domly within each zone every three months, for long-term within-plot 
spatial independence. Soil CO, efflux was also measured every two 
weeks for four root-partition cores per plot (two root-exclusion and 
two root-ingrowth) to determine soil- and root-derived components 
of the CO, efflux. At the same time as soil CO, efflux measurements, 
we measured soil temperature (using a HI98509 thermometer probe; 
Hanna Instruments) and soil moisture (using a Thetaprobe; Delta-T) 
at O-20 cmsoil depth for arandom location withina1mradius of each 
soil collar, or within the root-partition cores. 

Root-exclusion cores were made from PVC tubing (30 cm 
height, 10 cm diameter) with a 1-m nylon mesh base for drainage. 
Root-ingrowth cores (disturbance controls) had additional windows 
(around 340 cm?) covered with 2-mm mesh around the sides®. Ineach 
plot, two root-exclusion cores and two root-ingrowth cores were buried 
within each plot, approximately 30 cm from the heating cable (where 
the soil profile is warmed on average by 4 °C). 

The percentage contributions of fine roots and rhizo-microorganisms 
(root-derived) and free-living heterotrophic microorganisms 
(soil-derived) to the total soil CO, efflux were calculated as follows: 


soil-derived CO, efflux (%) 
= (root-exclusion core CO, efflux) (1) 
/(root-ingrowth core CO, efflux) x 100, 


root-derived CO, efflux (%) 
= 100 - soil-derived CO, efflux (%). 


(2) 


Total soil CO, efflux measured for soil collars was multiplied by the 
results from equations (1) and (2) to estimate the absolute contribu- 
tions of root-derived (roots, rhizo-microbial and mycorrhizal) and 
soil-derived (free-living microbial through the decomposition of litter 
and soil organic matter) components”. The partition cores were buried 
to 0-25 cm depth, where 95% fine roots occur*®. Therefore, the soil 


component consists of soil-derived CO, from the entire soil profile 
(witha minor contribution from fine roots at greater than 20 cm depth). 


Soil properties 

Soil was sampled before the experimental treatments began (to100 cm 
depth; Extended Data Table 1) and then every three months after the 
beginning of the experiment (0-10 cm; average responses in Extended 
Data Fig. 5), at a point within the plots where the surface soil is heated 
evenly (at approximately 30 cm from the heating structure). These 
samples were then analysed using standard procedures to determine 
the soil properties: total elements, available nutrients, microbial car- 
bon, nitrogen and phosphorus, and enzyme activities (see below). We 
calculated microbial CUE using microbial carbon, nitrogen and phos- 
phorus and enzyme activity data using a stoichiometric method”. Here 
we describe the responses following two years of warming, by using 
the average change in soil properties over two years (average of eight 
temporal measurements per plot, with n=5 per plot). 

Total carbon and nitrogen were determined simultaneously by auto- 
mated combustion and gas chromatography using a Thermo Flash 1112 
elemental analyser (CE Elantech). Total phosphorus was determined 
by ignition (550 °C, 1h) and extraction in 1 M H,SO, (16 h; 1:50 soil to 
solution ratio) with phosphate detection by automated molybdate 
colorimetry. Exchangeable cations were determined by extractionin 
0.1M BaCl, and detection by inductively coupled plasma-optical emis- 
sion spectrometry with an Optima 7300 DV (Perkin-Elmer)*®. Effective 
cation exchange capacity (ECEC) was calculated as the sum of the charge 
equivalents of Al, Ca, Fe, K, Mg, Mn and Na. Soil pH was determined in 
deionized water ina 1:2 soil to solution ratio. 

Extractable carbon and nitrogen were determined by fresh soil 
extraction in 0.5 MK,SO,. Soil microbial biomass carbon and nitrogen 
were measured by fumigation-extraction”, using ethanol-free chlo- 
roform as the fumigant followed by extraction with K,SO,. Extracts of 
fumigated and unfumigated soil were analysed for extractable organic 
carbonand nitrogen using a TOC-VCHN analyser (Shimadzu). Microbial 
carbon and nitrogen were calculated as the difference inthe respective 
nutrient between fumigated and unfumigated extracts and corrected 
for unrecovered biomass using a k factor*® of 0.45. Microbial biomass 
phosphorus was determined by hexanol fumigation and extraction 
with anion-exchange membranes“. Phosphate was recovered from 
anion-exchange membranes by shaking for 1hin 50 ml of 0.25 MH,SO,, 
with detection in the acid solution by automated molybdate colorim- 
etry using a Lachat Quikchem 8500 (Hach). Extractable phosphorus 
was determined on unfumigated samples and microbial phosphorus 
was calculated as the difference between the fumigated and unfumi- 
gated samples, with correction for unrecovered biomass using a k, 
factor* of 0.4. 

Soil enzyme activity (V,,,,) was determined for seven enzymes 
involved in carbon and nutrient cycling. We used microplate fluori- 
metric assays with 100 pM methylumbelliferone (MU)-linked substrates 
to measure the activity of B-glucosidase (degradation of 8 bonds 
in glucose), cellobiohydrolase (degradation of cellulose), N-acetyl 
B-glucosaminidase (degradation of N-glycosidic bonds), phospho- 
monoesterase (degradation of monoester-linked simple organic 
phosphates), sulfatase (degradation of sulfate esters) and B-xylanase 
(degradation of hemicellulose). For each soil sample, five replicate 
microplates were prepared and incubated at 2 °C, 10 °C, 22 °C, 30 °C 
or 40 °Ctocalculate the temperature sensitivity (Q,,.) of V,,,, and deter- 
mine V,,,, at control (26 °C) and warmed (30 °C) soil temperatures. 
Further information on protocols for enzyme analyses is provided 
elsewhere*“*. All analyses apart from total elements (carbon, nitro- 
gen, phosphorus), cations and pH were determined on fresh soils 
within 24 h of sampling, and K,SO, extracts within 6 h, to avoid the 
rapid changes that can occur during storage or pretreatment”. All soil 
chemical properties are expressed on the basis of oven-dry equivalent 
soil (determined by drying at 105 °C for 24 h). 


Determination of carbon and nutrient use efficiencies 

We estimated carbon, nitrogen and phosphorus use efficiencies (CUE, 
NUE and PUE) from ecological stoichiometry, whereby the CUE, NUE 
or PUE of an organism is a function of the difference between its ele- 
mental requirements for growth (carbon, nitrogen or phosphorus 
in biomass and enzymatic investment for acquisition) and the abun- 
dance of environmental substrate (carbon, nitrogen or phosphorus 
insoil organic matter). This approach assumes that enzyme activities 
scale with microbial production and organic-matter concentration, 
and that microbial communities exhibit optimum resource allocation 
with respect to enzyme expression and environmental resources. 
These assumptions are empirically supported by Michaelis-Menten 
kinetics and metabolic control analysis”. On the basis of this underly- 
ing assumption, CUE is calculated as 


CUE¢.x = CUEmax[Se:x/(Sc:x + Ky], 


Sex = (Bc:x/L¢:x)/EEAc:x: 


Here S,.x represents the extent to which the allocation of enzyme 
activities offsets the disparity between the elemental composition of 
available resources and the composition of microbial biomass. K, = 0.5 
and CUE, = 0.6 are the half-saturation constant and the upper limit 
for microbial growth efficiency based on thermodynamic constraints, 
respectively. EEAc is the extracellular enzyme activity (nmol g*h”): 
EEAc., was calculated as BG/NAG, where BG = B-glucosidase and 
NAG =N-acetyl B-glucosaminidase, and EEA.p was calculated as BG/P, 
where P = phosphomonoesterase. Molar ratios of soil organic car- 
bon to total nitrogen to total phosphorus were used as estimates of 
Lox. Microbial biomass ratios Bc... were also calculated as molar ratios. 


Statistical analyses 

Treatment (warming) effects on time-averaged total and partitioned 
CO, emissions and other soil properties (nutrients and microbial prop- 
erties) were tested using ANOVA. Treatment effects on soil CO; emis- 
sions were primarily tested for using mixed-effects models with CO, 
efflux as the response variable, warming treatment, soil moisture, sea- 
son, warming x soil moisture and season x soil moisture as fixed effects, 
and plot number asarandomeffect*®. As the within-plot location of the 
four soil CO, efflux sampling points (soil collar) was changed every three 
months, we did notinclude time as a random effect (repeated measures) 
inthe primary model. The soil collars within each plot were moved every 
three months, but always systematically assigned to zones within the 
plot: centre 1, centre 2, side 1 and side 2. The soil collars were moved 
to avoid long-term disturbance of the soil due to the presence of the 
soil collar, and to ensure that the within-plot soil CO, efflux measure- 
ments were spatially independent in the long-term. To test whether this 
pattern of measurement frequency and soil-collar movement influ- 
enced our results, we also ran models using a repeated-measures 
approach, with both space (soil-collar zonal location, nested in plot) 
and time as random effects. This repeated-measures analysis rein- 
forced the findings from the primary model analysis (Extended Data 
Table 2). To test the response of partitioned soil- and root-derived CO, 
emissions to warming, we used repeated-measures analyses because 
the partition cores were fixed, with plot number and time as random 
effects. Treatment effects on partitioned root- and soil-derived CO, 
efflux components were tested using repeated-measures mixed models 
with time and plot number as random effects. 

Treatment effects on soil-surface moisture content (0-20 cm depth) 
were tested using mixed-effects models with volumetric soil mois- 
ture as the response variable, warming treatment, season and warm- 
ing x season as fixed effects, and plot number as a random effect. In 
this case, we did not use a repeated-measures analysis because each 
within-plot soil-moisture sampling point was wholly independent. 
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Treatment effects on whole-soil-profile moisture, which were repeated 
measures, were tested using a repeated-measures analysis with space 
(depth nested in plot) and time as random effects. All statistical analyses 
were performed in R (version 3.5.2). 


Data availability 


The source data for this study (soil gas exchange, soil and 
microbial properties) are available at https://doi.org/10.6084/ 
m9.figshare.12144768. Source data are provided with this paper. 
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Extended Data Fig. 1| Thermal images ofa warmed plot. a—d, Pictures (a, b) structure was 3.5 min diameter and extended to 1.2 m depth, which resulted in 
and thermal images (c, d) of awarmed plot. The thermal images show the an effective heated plot of approximately 5 m diameter and 1.2 m depth. The 
soil-surface temperature Lh after the warming structure was switched on (c) experiment consisted of five warmed and control plot pairs in total. Image 
and after a period of thermal equilibration of soil (d). The circular heating credit:J. BujanandE.V. 
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and warmed plots. a, b, Soil volumetric moisture content (a) and soil before this during the testing phase). The points are daily means of soil profile 
temperature (b), for the period after the warming treatment began (December and the error bars represent one standard error of the variation by plot (n=5). 
2016 to December 2018), partitioned by soil depth (columns) and season The box (Tukey) plots show the median temporal value over sequential 100-day 
(rows). Box plots are standard (Tukey) plots, where the centre line represents periods. The shaded areas represent dry seasons (1January tol April). 
the median across the five plots over the study period, the lower and upper Treatment effects on annual or seasonal soil-profile moisture content were not 
hinges represent the first and third quartiles, and whiskers represent + 1.5 the significant; treatment effects on soil temperature were significant 
interquartile range. c,d, Temporal patterns in soil volumetric moisture (repeated-measures ANOVA, P< 0.001; Extended Data Table 4). 
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Extended Data Fig. 3 | Relationship between soil CO, efflux, soil moisture 
and season, in control and warmed plots. a, b, Relationship between soil CO, 
efflux and soil moisture in warmed (red) and control (blue) plots during the 
dry (a) and wet (b) seasons. Data were fitted (solid lines) using a quadratic 
function. c,d, Soil CO, efflux differences between warmed (red) and control 
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(blue) plots during the dry (c) and wet (d) seasons. Soil CO, efflux in warmed 
plots was significantly higher than controls for both dry and wet season, 
although the difference was greater for the wet season (average difference of 
2.8 mol CO, m’s“ for the wet season compared to 2.1mol CO,m~s* for 
the dry season). 
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Extended Data Fig. 4 | Contribution of root-derived and soil-derived sources 


value over 100-day periods. The dotted vertical line is when installation and 


to total CO, efflux. a, b, Data for root-derived (a) and soil-derived(b)sourcesare _ testing of warming plots began; the dashed vertical line shows when all five 
for the study period. The error bars for points represent onestandarderrorofthe | warming plots were switched on permanently. The shaded areas represent dry 


variation across the five plots. The box (Tukey) plots show the median temporal 


seasons (1January to1 April). 
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Extended Data Fig. 5| Average response of soil properties to warming. 

a-c, Data are for the study period December 2016 to December 2018. Data are 
partitioned into annual (a), dry-season (b) and wet-season (c) responses. Soil 
properties are microbial CUE (ratio, no units), soil-extractable nutrients (NH,, NO3, 
resin P; mg kg”), dissolved organic carbon (DOC; mg kg”), microbial elements (mic 
C, micN, mic P; mg kg“) and activities (V,,,,,) of extracellular enzymes B-glucosidase 
(BG-ase), phosphomonoesterase (P-ase), N-acetyl B-glucosaminidase (N-ase) and 
B-xylanase (Xy-ase) (nmol MU g‘ min”). The temperature sensitivity of activities of 


extracellular enzymes (Q,, of V,,,,) was determined for a-glucosidase (AG-ase), 
BG-ase, P-ase, N-ase, Xy-ase and cellobiohydrolase (Cel-ase) (nmol MUg' min”). 
Significant responses (P< 0.05) are highlighted in bold and underlined. The only 
significant annual response (a) is for microbial carbon (P= 0.02), although there is 
also a marginal non-significant response for B-xylanase activity (P=0.07). The 
centre line of each box plot represents the median across the five plots over the 
study period, the lower and upper hinges represent the first and third quartiles, 
and whiskers represent + 1.5 the interquartile range. 
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Extended Data Table 1| Soil properties 


Depth 0-10 cm 10-30 cm 30-50 cm 50-100 cm 

Total C (%) 3.63 (0.33) 1.34 (0.07) 0.77 (0.04) 0.65 (0.05) 
Total N (%) 0.30 (0.02) 0.11 (0.01) 0.06 (0.00) 0.05 (0.00) 
Total P (%) 0.34 (0.01) 0.20 (0.01) 0.14 (0.01) 0.11 (0.01) 
K2SO, extractable - NH4 (mg N kg") 1.87 (0.27) 1.58 (0.10) 1.17 (0.24) 1.00 (0.17) 
K2SO, extractable - NO3 (mg N kg") 1.38 (0.27) 0.35 (0.05) 0.15 (0.02) 0.12 (0.04) 
Resin P (mg P kg”) 1.40 (0.18) 0.90 (0.13) 0.40 (0.09) 0.61 (0.16) 
ECEC (cmol, kg") 48.8 (2.33) 49.1 (5.33) 42.90 (3.06) 42.4 (3.46) 
Soil pH (H20) 6.02 (0.06) 6.04 (0.05) 6.00 (0.05) 6.02 (0.07) 


Average values across soil depths for the study site (mean and standard error, n=10). Resin P is phosphate extracted by anion-exchange membrane. ECEC, effective cation exchange capacity. 
Values are pre-treatment and were determined in 2015 before the warming experiment was installed. 


Extended Data Table 2 | The determinants of soil CO, efflux variation 


1. No repeated measures 2. With repeated measures 

a) Annual soil CO, efflux Parameter SE DF P-value a) Annual soil CO, efflux Parameter SE DF P-value 
Fixed effects Fixed effects 
Warming 3.23 1.50 13 < 0.05* Warming 2.66 0.97 89 < 0.01 * 
Season -0.55 0.76 521 0.48 Season 1.34 1.48 52 0.37 
Soil moisture -0.66 1.62 521 0.69 Soil moisture 0.48 1.87 196 0.80 
Warming * soil moisture -1.24 1.52 521 0.42 Warming * soil moisture -0.46 1.23 494 0.71 
Season * soil moisture 241 1.69 521 0.15 Season * soil moisture -0.08 2.10 169 0.97 
Random effect Random effect 
Space (plot number) 3.28 1.18 19 < 0.05* Intercept (Space + Time) 271 1.40 69 0.06 
AIC value 2823 AIC value 2644 

b) Wet season soil CO, efflux Parameter SE DF P-value b) Wet season soil CO, efflux Parameter SE DF P-value 
Fixed effects Warming 3.09 1.12 104 < 0.01 * 
Warming 3.98 1.78 14 < 0.05* Soil moisture 1.19 1.53 157 0.44 
Soil moisture 243 1.34 2 0.07 Warming * soil moisture -0.80 1.55 402 0.61 
Warming * soil moisture -2.06 1.90 424 0.28 Random effect 
Random effect Intercept (Space + Time) 3.49 1.14 99 < 0.01" 
Space (plot number) 2.23 1.27 15 0.10 AIC value 2181 
AIC value 2324 ~~“) Dryseason soilCO, efflux. -—~—=—=s Parameter =< SE. -—=SsdDF_~—~Ss«éP-vadlue 

c) Dryseason soil CO, efflux Parameter SE DF P-value Warming 1.52 0.93 81 0.11 
Fixed effects Soil moisture -2.72 1.53 44 0.08 

“Wamingst—<CSststsSSSC(Cti‘~*‘«SCSSZ; 0.50 99 < 0.05* Warming * soil moisture -0.78 1.96 74 0.69 

Soil moisture -2.05 1.06 99 0.055 Random effect 
Random effect Intercept (Space + Time) 5.02 0.86 6 < 0.001 *** 
Space (plot number) 4.67 0.05 99 < 0.01 ** AIC value 465 
AIC value 469 


The mixed-effect model analyses were performed with: (1) random effect of space (plot), that is, no repeated-measures effect; and (2) random effects of space (soil-collar location nested within 
plot) and time, that is, including the repeated-measures effect. We performed the analyses first including all annual data (a) and then partitioned by wet season (b) and dry season (c). We tested 
with and without a repeated-measures effect because within-plot soil CO, efflux measurements were partially spatially independent (within-plot locations of soil collars were changed every 
three months, but not on every two-week measurement occasion within each three-month period; when moved, soil collars were relocated to randomly specified locations, but always 
systematically assigned to zones. Mixed-effects models were fitted using maximum likelihood, by beginning with the full model (six variables: warming effect, soil moisture determined at 
0-20 cm depth, season, interactions between soil moisture and warming and season, and random effects of space and time), followed by stepwise parameter removal. The final model was 
determined by lowest Akaike information criterion (AIC) value. For each parameter, the standard error (SE) and degrees of freedom (DF) are shown; significant values are in bold and highlighted 
by asterisks for P < 0.05 (*), P< 0.01 (**) and P < 0.001 (***). 
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Extended Data Table 3 | Treatment effects on root and soil components of CO, efflux 


Pre-treatment 
Root-derived CO, 

Fixed effects 

Warming 

Random effect 

Space (plot) + Time 
Soil-derived CO, 

Fixed effects 

Warming 

Random effect 

Space (plot) + Time 
Post-treatment 
Root-derived CO, 

Fixed effects 

Warming 

Random effect 

Space (plot) + Time 
Soil-derived CO, 

Fixed effects 

Warming 

Random effect 


Space (plot) + Time 


The flux components were derived from soil CO, efflux from root-ingrowth and root-exclusion cores. The mixed-effect model analyses were performed with random effects of space (plot) 
and time; a repeated-measures design was used because the partition cores were fixed. The analyses were performed including pre-treatment (January to June 2016) and post-treatment data 
following two years of warming (November 2016 to December 2018). For each parameter, the standard error (SE) and degrees of freedom (DF) are shown; significant values are in bold and 
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highlighted by asterisks for P < 0.05 (*), P< 0.01 (**) and P< 0.001 (***). 
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Extended Data Table 4| Determinants of soil moisture variation 


1) Surface 
a) Annual soil moisture Parameter SE DF P-value 
Fixed effects 
“Warming titi(‘™S™S™~””CH 005 0.04 84 0.19 
Season 0.09 0.03 524 < 0.01 ** 
Warming * Season 0.02 0.04 524 0.64 
Random effect 
Space (plot number) 0.39 0.02 84 < 0.001 *** 
AIC value -246 
b) Wetseason soil moisture Parameter SE DF P-value 
Fixed effects 
“Warming tst—<tsti‘sSSSCsCH 0B 0.02 8 0.17 
Random effect 
“Space(plotnumber) OS™ 0.48 0.02 8 < 0.001 *** 
AIC value -268 
c) Dryseason soil moisture Parameter SE DF P-value 
Fixed effects 
Warming -0.06 0.05 98 0.25 
Random effect 
Space (plot number) 0.39 0.03 98 < 0.001 *** 
AIC value 469 
2) Whole-profile 
a) Annual soil moisture Parameter SE DF P-value 
Fixed effects 
Warming -7.0x 102 5.8 x 107 28 0.24 
Season 0.15 6.8 x 10° 681 < 0.001 *** 
Warming * Season 9.0 x 10% 5.3x10* 7.2x 105 < 0.001 *** 
Random effect 
“ Space (depth / plot)+Time (day) _ 0.42 4.2x 102 29 < 0.004 *** 
AIC value -11 x 108 
b) Wetseason soil moisture Parameter SE DF P-value 
Fixed effects 
“Warming i(‘SOO~™ -0.05 0.07 28 0.44 
Random effect 
Space (depth / plot) + Time (day) 0.57 0.05 28 < 0.001 *** 
AIC value -7 x 10° 
c) Dryseason soil moisture Parameter SE DF P-value 
Fixed effects 
Warming -0.07 0.05 28 0.13 
Random effect 
Space (depth / plot) + Time (day) 0.42 0.03 29 < 0.001 *** 
AIC value -6 x 10° 


Data are for (1) surface soils, 0-20 cm soil, measured by soil moisture probe; and (2) whole profile, 0-20 cm, 50-70 cm and 100-120 cm, measured by fixed moisture sensors. For surface soils 
(1), the analyses were performed including all data (1a) and then partitioned by wet season (1b) and dry season (1c). Mixed-effects models were fitted using maximum likelihood, by beginning 
with the full model (four variables in 1a: warming effect, season, and interactions between soil moisture and warming and season, and a random plot effect; two variables in 1b and 1c: as above 
but without seasonal effects) and then using step-wise parameter removal. All measurements of within-plot soil moisture were spatially independent. For whole-profile data (2), the analyses 
were performed including all data (2a) and then partitioned by wet season (2b) and dry season (2c). Mixed-effects m@odels were fitted using maximum likelihood, by beginning with the full 
model (five variables in 2a: warming effect, season, and interactions between soil moisture and warming and season, and random effects of depth nested in plot and time; two variables in 2b 
and 2c: as above but without season effects) and then using step-wise parameter removal. Repeated-measures analyses were used because the soil-profile moisture data are from fixed 
sensors. The final model was determined by lowest Akaike information criterion (AIC) value. For each parameter, the standard error (SE) and degrees of freedom (DF) are shown; significant 
values are in bold and highlighted by asterisks for P < 0.05 (*), P< 0.01 (**) and P< 0.001 (***). 
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Although habitat loss is the predominant factor leading to biodiversity loss in the 
Anthropocene’”, exactly how this loss manifests—and at which scales—remains a 
central debate® °. The ‘passive sampling’ hypothesis suggests that species are lost in 
proportion to their abundance and distribution in the natural habitat”®, whereas the 
‘ecosystem decay’ hypothesis suggests that ecological processes change in smaller 
and more-isolated habitats such that more species are lost than would have been 
expected simply through loss of habitat alone”"’. Generalizable tests of these 
hypotheses have been limited by heterogeneous sampling designs and a narrow focus 
onestimates of species richness that are strongly dependent on scale. Here we analyse 
123 studies of assemblage-level abundances of focal taxa taken from multiple habitat 
fragments of varying size to evaluate the influence of passive sampling and ecosystem 
decay on biodiversity loss. We found overall support for the ecosystem decay 
hypothesis. Across all studies, ecosystems and taxa, biodiversity estimates from 
smaller habitat fragments—when controlled for sampling effort—contain fewer 
individuals, fewer species and less-even communities than expected froma sample of 
larger fragments. However, the diversity loss due to ecosystem decay in some studies 
(for example, those in which habitat loss took place more than 100 years ago) was less 
than expected from the overall pattern, as a result of compositional turnover by 
species that were not originally present in the intact habitats. We conclude that the 


incorporation of non-passive effects of habitat loss on biodiversity change will 
improve biodiversity scenarios under future land use, and planning for habitat 
protection and restoration. 


When habitat is lost, two different processes can lead to biodiversity 
decline. First, species are lost as habitat area is lost via the ubiquitous 
species-—area relationship"”’; fewer species can persist in smaller 
habitats, and species that are rarer in the landscape will go extinct 
owing to passive sampling. Second, the demography of species within 
the remaining habitat can be altered, increasing extinction risk (or 
decreasing recolonization) subsequent to immediate species losses”. 
This process—which is referred to as ecosystem decay’—supposes that 
biological processes within intact habitats embedded within smaller 
habitat areas differ from those in larger patches (for example, owing 
to edge effects, lowered dispersal or demographic stochasticity). For 
a given sampling effort, we might expect fewer species, changes in 
total and relative abundances, and often altered ecosystem functions, 
in smaller relative to larger habitats as a result of ecosystem decay 
(see empirical examples in refs. *3). Both of these processes—biodi- 
versity loss owing to passive sampling or to altered demography that 
leads to ecosystem decay—are well-known, but often confounded in 
empirical tests for which sampling efforts are variable. Additionally, 
most models that forecast biodiversity loss as a result of habitat loss 


implicitly assume passive sampling from larger to smaller habitats when 
developing scenario projections (for example, refs. "”*”), ignoring 
demographic effects. At present we are unable to synthesize the rela- 
tive importance of the two main processes of biodiversity loss with 
habitat loss. Without this synthesis, we cannot address the necessity 
of incorporating loss owing to both passive sampling and ecosystem 
decay into forecasts of future biodiversity scenarios. 


Passive sampling versus ecosystem decay 


In Fig. 1, we visualize the primary processes by which biodiversity and 
its empirical estimates can respond to habitat loss when explicitly 
controlling for sampling effort to establish explicit testable hypoth- 
eses!§?°, Although it is often impractical or impossible to count all 
of the individuals or species in a given habitat patch, we can readily 
establish a scheme to estimate the numbers of individuals, the num- 
bers of species or other estimates of biodiversity (for example, relative 
abundances) within samples of the habitat (as illustrated by the square 
plots in Fig. 1). By standardizing this sampling effort, we can compare 
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Fig. 1| Conceptual illustration of the hypotheses and data structure. 

a, Small fragments (left) showing three hypotheses for comparisons made with 
the large fragment (right): (1) passive sampling, in which the densities of 
individuals and the relative abundances of species are similar to that of the 
large fragment; (2) ecosystem decay (evenness), in which the relative 
abundances of species are more uneven in the small fragment compared tothe 
large fragment; and (3) ecosystem decay (individuals), in which the density of 
individuals is lower in the small fragment compared to the large fragment. 
Sampling is illustrated with boxes inside each fragment. In this example, the 
sampling effort increases from small (two samples) to large (four samples) 
fragments, but sampling effort can be standardized post hoc because 


effort-standardized estimates from larger, more-continuous habitat 
patches to those from smaller habitat patches. With this, we can then 
explicitly test the importance of within-patch demographic changes 
that lead to the ecosystem decay of biodiversity measures amidst losses 
due to passive sampling”. Additionally, even if sampling effort among 
patches was not standardized in the original study, but is known, we 
can use post hoc methods for standardization (for example, various 
forms of rarefaction; Methods). 

Several outcomes are possible when comparing sample-effort- 
controlled biodiversity patterns from larger habitat fragments to those 
from smaller habitat fragments”’: three illustrative cases of these out- 
comes are shown in Fig. 1. The first is passive sampling: here, while there 


Standardized evenness 


Large Small Large 


Fragment size Fragment size 


sample-level data are available; other sampling methods are slightly different 
but can also be standardized (Methods). b, We compare the standardized 
number of individuals (average individuals per sample) (left), standardized 
species richness (average richness per sample) (centre) and standardized 
species evenness (average evenness per sample) (right) between a small 
fragmentand a large fragment to illustrate how this information can be used to 
disentangle the passive sampling versus the ecosystem decay hypotheses. For 
the purposes of simple illustration, the large fragment has acompletely even 
species abundance distribution, even though this is unrealistic. This figure was 
created for this paper by F. Arndt (www.Formenorm.de). 


may be fewer total species in smaller relative to larger fragments, the 
numbers of individuals, species and relative abundances of species with 
a given standardized sampling effort is the same. That is, we expect 
there to be no relationship between any of these standardized estimates 
of biodiversity and habitat fragment size. In Extended Data Fig. 1, we 
use simulations to explore the robustness of this expectation and show 
that it is not sensitive to extreme non-randomness (that is, aggregation) 
in species distributions. Second, we consider ecosystem decay as a 
result of changes in the relative abundance of species. In this outcome, 
differential demographic responses of species in the assemblage lead 
to some species becoming relatively more abundant and others rela- 
tively less abundant (that is, relative abundances become more uneven) 
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Fig. 2| Ecosystem decay drives patterns of biodiversity loss in habitat 
fragments. a, Global map, indicating the taxon group and location of studies 
(n=123) includedin our analyses. b-d, Standardized samples show that 
number of individuals (b), species richness (c) and evenness (d) allincreaseasa 
function of fragment size. Solid black lines and shading show overall 


in smaller compared to larger fragments. As a result, the numbers of 
species for a given sampling effort declines, as do other measures of 
diversity based on relative abundances, with fragment size. The third 
example is ecosystem decay that results from reductions in the num- 
bers of individuals. Here, there are fewer total species and also fewer 
species per standardized sampling effort in smaller relative to larger 
fragments, resulting from lower demographic rates of species in the 
small compared tothe large fragments. In this scenario, while there are 
fewer individuals insamples from smaller fragments, all species have 
the same demographic response such that the relative abundances of 
species have not changed. Asa result, the rarefied richness (standard- 
ized for the numbers of individuals) is similar in smaller and larger frag- 
ments. Although this is not shown in Fig. 1, itis also likely that ecosystem 
decay can result from changes both to the total numbers of individuals 
and to species relative abundances, from larger to smaller fragments. 
These are not the only possible outcomes, but other patterns—such as 
increases in abundance or evenness with decreasing habitat size—are 
less likely. Such increases in abundance or evenness would in turn lead 
to higher measures of diversity in standardized samples within smaller 
patches**. Some of these other outcomes are plausible, but we do not 
discuss them further as the bulk of our evidence indicates they are far 
less common than the three shown in Fig. 1. 


Testing the hypotheses 

Although the hypotheses illustrated in Fig. 1are straightforward, appro- 
priate syntheses have not heretofore been possible because of highly 
heterogeneous data that come from different sampling designs, as 
well as often-confounded analyses that do not control for sampling 
effort and grain”. However, if ecosystem decay in the form of changes 
to the numbers of species, individuals and/or relative species abun- 
dances in standardized samples is pervasive, this would mean that the 
majority of biodiversity loss scenarios—which assume passive sampling 
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relationships and 95% credible intervals for each metric; the slope (B) 
coefficient for each metric and its 95% credible interval are shown at the top. 
Coloured lines show study-level relationships for different taxon groups (as 
showninthe map legend). 


and ignore ecosystem decay—probably underestimate the effects of 
habitat loss on species loss. Here, we have developed an explicit and 
comprehensive evaluation of the passive sampling versus ecosystem 
decay hypotheses using 123 datasets in which species abundances were 
sampled from multiple habitat fragments of different sizes within once 
continuous landscapes” (Fig. 2a). Our dataset was explicitly compiled 
for this effort, and we developed an analytical pipeline that controls for 
sampling effort to test the hypotheses of biodiversity loss stemming 
from habitat loss. Datasets had from 2 to 53 habitat fragments of varying 
size within a given ecosystem, ranging from less than1hato more than 
10,000 ha of continuous habitat cover, and included studies on plant, 
invertebrate and vertebrate assemblages. Approximately 85% of our 
studies came from forested landscapes embedded within anthropo- 
genic matrices (for example, agriculture), and the other studies came 
from shrublands, grasslands and wetlands. Although our search for 
datasets was global, there are clear geographical patterns in terms of 
where studies were collected; the majority come from tropical areas, 
especially Central and South America (Fig. 2a) (further details are pro- 
vided in the Methods and ref. 2”). 

Overall, we found strong support for the ecosystem decay hypothesis 
across all studies (Fig. 2, Extended Data Fig. 2). After controlling for sam- 
pling effort (Methods), we found that each of the biodiversity variables 
increased as the size of the habitat fragments increased. This included 
increases inthe standardized number of individuals with fragment size 
(Fig. 2a), increases in standardized species richness with fragment size 
(Fig. 2b), increases in the evenness (relative abundance) of species inthe 
community with fragment size (estimated via the effective number of 
species conversion of the probability of interspecific encounter (Sp;),a 
measure of evenness that is relatively insensitive to sample size (Meth- 
ods)) (Fig. 2c) and increases in several other sample-effort-controlled 
biodiversity measures with fragment size (Extended Data Fig. 2). 

We developed a second analysis to explicitly account for the uncer- 
tainty surrounding the expected biodiversity estimates from the passive 
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Fig. 3 | Study-level variation in the response of species richness to habitat 
loss. a-d, Density plots of posterior distributions for study-level slope 
estimates of standardized species richness grouped by taxon group (a), 
continent (b), time since fragmentation (c) and matrix filter (d). The dashed 
vertical line shows the zero slope expected from passive sampling. Each 


sampling hypothesis. Simulations of the passive sampling hypothesis 
show that there can be variation in the expected slopes of biodiver- 
sity responses to the size of habitat fragments (Extended Data Fig. 1), 
although our observed responses (Fig. 2, Extended Data Fig. 2) were 
considerably greater than any departures in our simulations. Never- 
theless, to account for this variation, we calculated variation in the 
‘expected’ value for a given smaller fragment by resampling from the 
largest fragment. We then included uncertainty in the expected value 
under passive sampling when making comparisons to the ‘observed’ 
value in each smaller fragment. The resulting z-score (Methods) allows 
usto discern whether and by how much biodiversity in smaller fragments 
deviates from that expected under passive sampling. With this analysis, 
we found that for each sample-standardized diversity response, the 
observed values deviated more from expected values (lower z-scores) 
for smaller patch sizes, which indicates strong support for ecosystem 
decay having a role in the response of taxa to habitat loss (Extended 
Data Fig. 3). Moreover, to make all 123 datasets comparable, we made 
decisions regarding response variables (for example, howto rarefy from 
different types of data), as well as the driver variables (for example, how 
to estimate the size of habitats classified as ‘continuous’). We present 
several analyses (Methods) that examine the sensitivity of our results, 
and showthat our overall results are highly robust (Extended Data Fig. 4). 


Variation in the effects 

We were also interested in understanding how the response of bio- 
diversity to fragment size varied across studies owing the taxa con- 
sidered, the continent on which the studies were located, the time 


density plot is based on1,000 samples from the posterior distribution of each 
study-level slope estimate. Densities are shaded by quantiles and the black 
diamond shows the median for each group. Solid line and surrounding shading 
show the overall slope and 95% credible interval. n values next to each plot refer 
tothe number of studies included in each category. 


since fragmentation in the study, the quality of the matrix habitat sur- 
rounding the fragments in the study, and the latitude of the study. We 
focused on differences in the slopes of the response (with a slope of 
approximately zero expected from passive sampling) because differ- 
ences in intercepts are influenced by the species pool and sampling 
methodology, and thus not meaningful in this context. We illustrate 
variation around the overall trend for sample-effort-standardized spe- 
cies richness in Fig. 3, and for sample-effort-standardized number of 
individuals and evenness in Extended Data Fig. 5. 

Witha fewexceptions, all results are qualitatively similar to those for 
species richness, which showed -—first—that there are no clear differ- 
ences across taxa (Fig. 3a). This is somewhat surprising, as it has often 
been suggested that larger taxa or those that disperse poorly might 
respond to habitat loss more markedly than taxa with the opposite traits 
(for example, refs. 77). However, the broad taxonomic groupings that 
were necessary for this global analysis do not allow for comparisons 
of species traits that vary within a group (for example, wind- versus 
animal-dispersed plants). 

Second, we found little signal of continent, other than for temperate 
North America (which showed higher than average slopes) and Europe 
(which showed lower than average slopes) (Fig. 3b). One possibility is 
that these deviations reflect differences in other covariates. Specifically, 
habitats in 10 of the 16 studies from Europe were fragmented more 
than 100 years before the data were collected, whereas fragmenta- 
tion occurred much more recently elsewhere. Likewise, studies from 
North America all had intermediate-to-harsh environmental filters. We 
explore time since fragmentation and matrix quality in more detail in 
the following two paragraphs. 
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Fig. 4| Relationship between the species-richness response and the species 
compositional turnover and nestedness. Studies with strong richness 
responses have higher turnover between small fragments (that is, anegative 
slope between turnover and fragment size) and the highest nestedness. 

a, Study-level slope estimates of the relationship with fragment size for species 
richness and the turnover component of Jaccard’s dissimilarity (Pearson’s 


Third, studies with the longest time since habitat loss had the shal- 
lowest slopes (Fig. 3c). This result might seem counterintuitive given 
the ‘extinction debt’ concept, which refers to the fact that species may 
persist in fragments after habitat loss, but eventually go extinct as 
populations are no longer viable”. However, alternative explanations 
for shallower slopes in older habitat fragments could be that popula- 
tions eventually compensate for early losses (for example, via dispersal 
into the fragment or recovery of the landscape matrix), or that species 
turnover via immigration of more-tolerant species compensate for 
losses of more-specialized species». We address the latter in the ‘Effects 
on species composition’ section below. 

Fourth, studies in which fragments had habitat matrices that were 
categorized as harsh (for example, intense agriculture or urban areas) 
had steeper slopes (Fig. 3d). Fragments within matrices that were 
categorized as light (for example, secondary forests or shade-grown 
low-intensity agriculture) had shallower slopes (Fig. 3d). Sucha result 
is consistent with expectations that the effects of ecosystem decay 
should be more pronounced when matrices are distinct from the habitat 
of interest, and support the idea that less-intensive habitat use (and 
restoration) can reduce the effect of habitat loss on biodiversity loss 
within intact habitats. 

Finally, we compared our results with those froma recent study that 
used very different methodology and scales to show greater responses 
to fragmentationin tropical compared to temperate areas”°. We found 
a weak decline in the slope estimate with increasing (absolute) lati- 
tude (-0.0005; 95% credible interval of -O.001-—0.0001) (Extended 
Data Fig. 6), and there was no indication of variation among taxa in 
this result. Even though our study explicitly focused on local-scale 
variation in biodiversity within remnant fragments of different sizes 
(whereas ref. © examined landscape-level variation in biodiversity 
across fragmented habitats), these very different approaches both 
show stronger fragmentation effects in tropical areas. 

A general feature of natural landscapes is that species are typi- 
cally distributed non-randomly, and habitats are often destroyed 
non-randomlly. As a result, it is possible that habitat fragments left 
remaining in the landscape are qualitatively different from those that 
were destroyed, and that diversity responses to habitat destruction 
can emerge from geometric, as well as demographic, effects”’. For 
example, if a habitat fragment that remains after habitat destruction 
is of low quality, it may have less diversity even if there were no shifts 
in species demography following habitat destruction. If this were true, 
the patterns we observed would still be consistent with the ecosystem 
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correlation coefficient p = —0.42 (-0.56, -0.27; 95% confidence interval)). 

b, Study-level slope estimates of the relationship with fragment size for species 
richness and the turnover component of Jaccard’s dissimilarity (Pearson’s 
correlation coefficient p = 0.48 (0.34, 0.61; 95% confidence intervals)). Solid 
lines show linear models fit with ordinary least squares to aid visualization of 
relationships. 


decay hypothesis, but the mechanism would be slightly different: they 
would be due to spatial rather than temporal variation in local demog- 
raphy after habitat loss. With the data that we have available, we have 
no indication of the quality of habitat patches before or after the sur- 
rounding habitats were destroyed, and thus we cannot differentiate 
whether spatial or temporal demographic variation (or both) underpin 
the observed ecosystem decay. Nevertheless, we argue that scenarios 
in which lower-quality patches are left behind (for example, if habitat 
destruction favours the removal of habitats with better resources to 
extract) are equally as likely as those in which higher-quality patches 
remain (for example, if regulations and habitat protection favour leav- 
ing particular higher-quality habitats). If true, then our null expectation 
that sample-controlled diversity estimates should be equal on average 
in larger versus smaller fragments should remain, and that any changes 
to this would bea result of shifts in demographic responses of species 
after habitat loss. 


Effects on species composition 


Our focus thus far has been on biodiversity change measured as num- 
bers and relative abundances of species, but habitat fragmentation 
can also alter the composition of species. For example, habitat frag- 
mentation often disadvantages habitat specialists but can benefit 
(or have a lower effect on) habitat generalists”®. We examined com- 
positional change and how it related to habitat loss by partitioning 
total community dissimilarity between large and small fragment pairs 
into: (i) the turnover component, which indicates the degree to which 
compositional dissimilarity results from different species; and (ii) the 
nestedness component, which indicates how much of the dissimilar- 
ity between fragments results from species in smaller habitats being 
subsets of more speciose fragments. On the whole, turnover among 
fragments was higher than nestedness components (Extended Data 
Fig. 7), consistent with the idea that there was an overall shift from 
more-specialist to more-tolerant species in smaller habitat fragments. 
However, there were enough cases of high nestedness between small 
and large fragments that there was no overall trend in the contribution 
of turnover to compositional change between large and small fragments 
(Extended Data Fig. 7a, b). On the other hand, nestedness components 
between fragments increased as the difference in size of habitat frag- 
ments increased, which indicates that biodiversity loss from larger to 
smaller fragments contributes to nested compositional differences 
among those fragments (Extended Data Fig. 7c, d). 


Differences across studies in the species compositional turnover 
may also help to explain some of the variation in biodiversity responses 
to the size of habitat fragments. Studies with shallower slopes (for 
example, those with along time since fragmentation or with less-harsh 
matrices) had greater contributions of species turnover to dissimilarity 
(Fig. 4a), potentially buffering the richness response (for example, if 
more-tolerant species replaced those species that were lost). On the 
other hand, studies with steeper slopes (for example, with less time 
since fragmentation or harsher matrices) had greater contributions of 
nestedness (Fig. 4b), indicating that species losses contributed more 
to the dissimilarity. 


Discussion 


Our results strongly support the notion of ecosystem decay; that is, 
the biodiversity loss that results from habitat loss is greater than that 
expected if species losses result from passive sampling with no changes 
in local demography. However, we also found that species turnover 
slightly buffers this effect, at least in some cases. Because our results 
include many taxa in many systems, we cannot discern among the many 
different mechanisms (for example, edge effects, demographic sto- 
chasticity or reduction of dispersal), or combinations thereof, by which 
ecosystem decay occurs. It is also important to emphasize that our 
analysis is conducted strictly within habitat patches. Thus, we cannot 
fully resolve the current debate as to whether and how habitat frag- 
mentation per se (thatis, the spatial configuration of habitat fragments 
when controlling for total habitat amount) influences biodiversity at the 
scale of the landscape’ ®: the landscape-level data required to address 
this question are exceedingly rare. Even though our analyses point to 
a consistent decline in biodiversity in the habitats that remain after 
habitat loss and fragmentation, it is certainly possible that—at some 
scales—biodiversity might be unaffected or even increase at the land- 
scape scale after fragmentation, depending on the balance between 
species that favour intact habitats and those that favour matrix habi- 
tats®. Nevertheless, assuming a given habitat type continues to be lost 
disproportionately, and that anumber of species tend to specialize 
on the declining habitat type, then our results predict that the loss of 
biodiversity under an ecosystem decay scenario will extend beyond 
that expected from passive sampling. 

Our results point to a critical limitation when forecasting the bio- 
diversity loss expected due to habitat loss using the logic of the spe- 
cies—area relationship or the related relationship between endemic 
species and area (the endemics-area relationship)”’”"®. Any such mod- 
elling implicitly assumes that the underlying shape of the species—area 
or endemics-area relationships depends only on habitat geometry 
and the distribution of species before the habitats are destroyed, and 
thus implicitly assumes that species loss follows the passive sampling 
hypothesis (even if species distributions and habitat losses are not 
random per se). Improvements to these forecasts allow that species 
may persist in the matrix after habitat loss”’, but still assume that the 
distribution and demography of species in the remaining habitats is the 
same regardless of the amount of surrounding habitat. Because we find 
support for the ecosystem decay hypothesis (showing that biodiversity 
loss from habitat loss exceeds that which would have been expected 
simply from passive sampling of lost habitats), forecasts that use tools 
based onthe species—area or endemics-area relationships will under- 
estimate longer-term patterns of biodiversity loss. We can apply the 
results of our global synthesis to quantify proportional species loss with 
habitat loss under the endemics-area relationship (passive sampling) 
and ecosystem decay models, which has been attempted previously 
with limited data and much speculation”. In Extended Data Fig. 8, we 
show how one could begin to predict species loss with habitat loss 
using a version of the endemics—area relationship parameterized with 
data from our synthesis, and compare the expectations of biodiversity 
loss to those that would emerge if these demographic changes were 


not incorporated. This provides an important step towards making 
more realistic forecasts of biodiversity loss, which will pave the way 
for the development of more accurate tools for conservation-based 
management. At the same time, our results suggest that restoration 
in habitat matrices may moderate the pervasive ecosystem decay that 
we observed in our synthetic analysis. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Literature search and data inclusion 

Full details for the search criteria and process for developing the 
dataset analysed have previously been published”. In brief, the data 
needed for our analyses require data on the numbers of individu- 
als of each species from each fragment of different sizes, as well as 
information on sampling design, numbers of samples (that is, plots, 
transects, traps, mist nets and so on) and sampling effort (that is, plot 
areas, transect length, trapping time, net lengths and so on) ineach 
fragment. Because such data are exceedingly difficult to find, we did 
not restrict ourselves to formally standardized literature searches, 
but rather attempted to find as many datasets as possible. We also 
consulted important review papers (for example, refs. #72797") for 
possible data sources, including forward citation searches. By extract- 
ing data from publications, appendices, and online repositories from 
anumber of studies, and contacting authors directly for many more, 
we collated data from 117 studies for our purpose®**", In addition, 
we used data from six studies in the PREDICTS database that met 
our inclusion criteria’*“’, giving us a total of 123 studies. Here, ‘study’ 
refers to sampling at a given time and place ona specific taxonomic 
group. If two or more taxonomic groups sampled in different ways 
were presented in the same paper, they were treated as two separate 
studies (hence the discrepancy between the number of studies and 
references). 


Explanatory variables used for analysis 
We also recorded metadata for each dataset, several of which we exam- 
ined as covariates in the analysis. Specifically, we used absolute latitude 
and the following categories as covariates. 


Taxon groupings. This included plants, invertebrates (mostly insects), 
birds, mammals and acategory that included reptiles and amphibians 
(because they were often sampled together, and there were not enough 
studies of each for separate analyses). 


Continent (or other geographical grouping). We classified studies 
as occurring in Africa, Asia, Central America (we included Mexico here 
for biogeographical, rather than political, reasons), Europe, North 
America, Oceania or South America. 


Time size fragmentation. We classified studies as ‘recent’ (fragmenta- 
tion occurred less than 20 years previous to the study), ‘intermediate’ 
(fragmentation occurred 20-100 years previous to the study) and ‘long’ 
(fragmentation occurred more than 100 years previous to the study). 


Matrix quality. We categorized studies on the basis of three qualitative 
categories of matrix quality: ‘light’, in which remnant habitat fragments 
were surrounded by relatively hospitable habitats (for example, sec- 
ondary forest or shade-grown coffee); ‘intermediate’, in which remnant 
habitats occurred within a matrix of lightly used habitats (for example, 
forests surrounded by early successional habitat or livestock graz- 
ing); and ‘harsh’, in which remnant habitats occurred within a matrix of 
largely inhospitable matrix (for example, intense agriculture or water 
when reservoirs were built to create islands). 


Vegetation type. We categorized the vegetation type within the habitat 
fragments as forest (n= 105), shrubland (n= 14), grassland (n=3) and 
wetland (n=1). As the majority of studies were from forested areas, 
we do not present the study-level variation in slope estimates for this 
categorization (which did not show any meaningful variation). 


Standardization and estimation of biodiversity parameters 

We made no attempt to standardize sampling across studies because 
of the inherent limitations of doing so when sampling methodology 
varies. Instead, we calculated a series of biodiversity indices that allow 
standardized comparisons of biodiversity differences among frag- 
ments within studies, which was crucial for evaluating the hypotheses 
overviewed in Fig. 1. 

Sampling designs fell into one of three broad categories: (i) standard- 
ized sampling with equal numbers of samples (often one, but some- 
times several) and thus equal sampling effort among habitat fragments 
(n=50 studies with this kind of data structure); (ii) unequal number 
of standardized samples (typically more samples in larger fragments), 
but where data were available for each sample such that average per 
sample metrics could be calculated for each habitat fragment (n= 26 
studies with this kind of data structure); (iii) unequal sampling effort 
among fragments, because the number of standardized samples varied 
among fragments, and the data were not available ona per sample basis 
(that is, they were pooled across all samples ina fragment), or because 
the effort per sample varied among the fragments (for example, dif- 
ferent length of transects in different fragments) (n = 47 studies with 
this kind of data structure). 

For each habitat fragment, we calculated several biodiversity indices 
that reflect different aspects of the total and relative species abun- 
dances, as well as aspects of sample completeness”°”. When there 
were several samples in a given fragment, we took the average of the 
calculated indices to get a fragment-level per-sample estimate. 

The standardized number of individuals (N,,q) for each habitat frag- 
ment is typically the average number of individuals for a constant 
sampling effort. For studies in sample design (i), N..g was the number 
of individuals per sample in each fragment, or the average number 
of individuals when there were multiple samples per fragment. For 
studies in sample design (ii), N..q was the average number of individu- 
als per sample in each fragment. For studies in sample design (iii), we 
calculated N,,q by first calculating the relative sampling effort (SE,,,) for 
each fragment by dividing the total sampling effort in that fragment by 
the minimum sampling effort per fragment across all fragments in the 
study, and then second by dividing the observed number of individuals 
across all samples in that fragment (N,,,) by the relative sampling effort 
of that fragment (that is, N..q4=Nops/SE,e, rounded to the nearest integer). 

For each fragment, we calculated the number of species expected 
for a given constant sampling effort, which we call standardized spe- 
cies richness (S,,4). For studies in sample design (i) or (ii), S,.4 was the 
average of the observed richness per sample in each fragment. For 
studies insample design (iii), an individual-based rarefaction curve was 
created for each fragment (based on, for example, refs. “°°, and Seq 
was calculated as the expected species richness for the standardized 
number of individuals (N,,4) in each fragment. 

To measure standardized evenness, we quantified the effective 
number of species using the conversion of Hurlbert’s™™ probability of 
interspecific encounter (S,,,) (PIE=1- Simpson’s diversity)”. Here, we 
calculate S,,, per sample for each fragment (sample design (i) and (ii)). 
For studies in sample design (iii), we calculated S,,, using the full data of 
each fragment. Although PIE (and S,,,) can be influenced by sample size 
at very low sampling effort, it quickly stabilizes to provide a relatively 
unbiased estimate of relative species abundances in a community’. 
We hereafter refer to S,,, as ‘standardized evenness’ for simplicity to 
indicate that it is influenced by the relative abundances of species’, 
although we recognize that the term evenness is variously used in the 
literature. Furthermore, we compare results when only including stud- 
ies with sample designs (i) and (ii) (n = 76 studies) to those that also 
include sample design (iii) (2 = 47 studies) (see ‘Sensitivity analysis’). 

Rarefied richness (S,,), the number of species for a standardized num- 
ber of individuals’, was calculated according to a previously pub- 
lished protocol’. The reference number of individuals for S, was the 


minimum of twotimes the smallest number of individuals per sample 
and the maximum number of individuals per sample among all frag- 
ments inthe study. 

Coverage standardized richness (S,,,); this measure standardizes 
comparisons of species richness with respect to sample complete- 
ness‘. We derived the reference coverage for comparisons among 
fragments following previously published suggestions”°. Accordingly, 
the reference coverage for S,,, was the minimum of the maximum cov- 
erage per sample and the minimum of the sample-scale coverage with 
two times the observed numbers of individuals per sample. 

Asymptotic richness for each fragment was calculated using the 
Chao lestimator (Shao) 

Spier Sn AN S chao Were Calculated using the R package mobr’*, and S,,, 
was calculated from the R package iNEXT™”. 

To account for uncertainty in our biodiversity indices that results 
from sampling of fragments, we resampled 200 abundance distri- 
butions by resampling the observed number of individuals with 
replacement from the observed species abundance distributions and 
calculated all 5 biodiversity indices from the 200 resamples. For each 
analysis, we used the mean of each biodiversity index across the 200 
resamples for each fragment in each study. 


Dissimilarity partitioning 

We calculated the pairwise dissimilarity in community composi- 
tion between pairs of fragments within a study’”’ by partitioning the 
incidence-based dissimilarity metric (Jaccard) into turnover and nest- 
edness components””. For each fragment pair, we also partitioned the 
abundance-based (Ruzicka) dissimilarity metric into balanced variation 
in species abundances (the abundances of some species decline and 
other species increase from one fragment to the other) and abundance 
gradient (the abundances of all species decline or increase equally 
from one fragment to the other) components. Fragment pair calcula- 
tions of the dissimilarity partitions were calculated for standardized 
sampling effort. For studies with the same number of samples in each 
fragment (study design (i)), we directly calculated the dissimilarity 
and its partitioning for all fragment pairs. When there were different 
numbers of samples in fragments and sample-level data (study design 
(ii)), we implemented a resampling approach for fragment pairs that 
differed in their number of samples. We randomly selected one sample 
from each fragment and calculated dissimilarity and its partitioning, 
and repeated this procedure 200 times to calculate the average values 
for the dissimilarity partitions across the resamples for each fragment 
pair. For fragment pairs in study design (iii), we used our calculations of 
the relative sampling effort (SE,,,) as well as the standardized number of 
individuals (N,,.q) for each habitat fragment. According to the calculation 
of standardized species richness (see ‘Standardization and estimation 
of biodiversity parameters’), we resampled N,,q individuals from the 
abundance distributions of each fragment in the pair and calculated 
the dissimilarity partitions from the resampled data. This procedure 
was repeated 200 times to calculate the average values for the dissimi- 
larity partitions across the resamples for each fragment pair. We used 
the function beta.div.comp() function from the R package adespatial 


for dissimilarity partitioning calculations. 


Establishment of the null expectation 

Although Fig. 1 illustrates the expectation for passive sampling and 
the alternative hypotheses when habitat size varies, the uneven 
distribution of species across the landscape can influence habitat 
fragmentation effects”’. This might lead some to wonder whether 
the same types of spatial structuring might also influence patterns 
expected at the within-patch level where sampling effort was held 
constant. We used a spatial simulation model to show that our null 
expectation under passive sampling (that is, a zero slope) was robust 
across different types of spatial aggregation pattern (Extended Data 
Fig. 1). The regional species pool was set to have 40,000 individuals, 


2,000 species and alog-normal species abundance distribution with 
acoefficient of variation equal to 1. We examined three different lev- 
els of within species aggregation via a Thomas cluster process; each 
species had a single cluster, and the degree of intraspecific aggre- 
gation was controlled by setting the displacement of individuals of 
each species from the single cluster (or mother point) to values that 
resulted in either an approximately random distribution (sigma =1), 
or increasingly aggregated spatial distributions (sigma = 0.1, 0.02). 
To avoid edge effects, fragments were randomly placed (without 
overlap) within the central square unit of the landscape (fragment 
area = {0.0005, 0.005, 0.05, 0.5}), and samples were taken froma 
single quadrat (area= 0.0001) randomly placed within each fragment. 
We quantified total numbers of individuals, species richness and 
evenness (S,,-) for each sample, and fit ordinary least-squares linear 
models to log-transformed diversity (or total numbers of individu- 
als) as a function of log-transformed fragment size. This process was 
repeated 2,000 times, and we plotted the slope coefficients from the 
linear models to illustrate the expectation of the passive sampling 
hypothesis (Extended Data Fig. 1). All simulations were run using the 
R package mobsim’”. 


Statistical analyses 

Although many ecological syntheses use meta-analytic tools to 
compare published estimates’, our approach using underlying 
assemblage-level data enabled us to directly analyse data across stud- 
ies'’**, To examine the relationship between biodiversity indices and 
fragment size, we fit two sets of hierarchical generalized linear models 
to the data: one to model each of the biodiversity indices described 
in ‘Standardization and estimation of biodiversity parameters’ as a 
function of fragment size; the second to explicitly account for uncer- 
tainty in the expected levels of biodiversity under the passive sampling 
hypothesis (see ‘Estimating and analysing uncertainty’). As the results 
were qualitatively consistent between the two sets of models, we pre- 
sent the relationships between biodiversity indices and fragment size 
in the main text, and show the robustness of our results to the inclu- 
sion of uncertainty surrounding the passive sampling hypothesis in 
Extended Data Fig. 3. 

In several studies, large habitat fragments were labelled as continu- 
ous, without provision of the actual size of these areas. For these cases, 
we imputed the size of the continuous habitat to be ten times as large as 
the next largest habitat fragment size reported inthe respective study. 
However, we also used different multiplicative factors to assess the 
robustness of our results to this assumption (see ‘Sensitivity analysis’). 

Fragment size was log-transformed and centred by subtracting the 
mean (log) fragment size of each study from the size of each fragment 
before fitting models, and back-transformed for presentation in the 
figures. For each biodiversity measure, we estimated the overall rela- 
tionship with fragment size, and also allowed the slope and intercept of 
the relationship to vary for each study in the dataset (that is, the slope 
and intercept were estimated as fixed effects, as well as random effects 
for each study). Following standardization, none of our response vari- 
ables remained with exclusively integer values, which precluded the 
use of Poisson or the negative binomial error distributions. Because 
all responses were constrained to be positive, we used the log-normal 
error distribution and an identity link function in the analysis of bio- 
diversity indices. 

We visualized variation in effects of fragment area on biodiversity 
associated with taxon group, geographical region, time since fragmen- 
tation and matrix quality using the posterior distributions of study-level 
slope estimates. Alternatively, we could have fit models with interac- 
tions between the covariates and fragment area, but—owing to data 
limitations—these would have been restricted to two-way interactions. 
However, it is likely that multiple covariates combine and contribute to 
variation in the effects of fragmentation on biodiversity. Because each 
study represents a combination of all these covariates in our analyses, 
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examining study-level variation allows us to efficiently simultaneously 
adjust for and characterize relationships among multiple covariates and 
fragmentation effects on biodiversity. However, to assess the robust- 
ness of patterns found in the study-level variation for the different 
groups, we fit models with two-way interactions between fragment 
size and taxon group, geographic region, time since fragmentation 
and matrix quality, respectively (Extended Data Tables 1-3). 


Estimating and analysing uncertainty 

The passive sampling hypothesis posits that biodiversity in an 
effort-standardized sample in a small fragment is the same as the 
biodiversity in a corresponding effort-standardized sample ina large 
fragment (Fig. 1). That is, the expected value of the standardized biodi- 
versity metrics under the passive sampling hypothesis in smaller frag- 
ments is equal to that in the largest fragment for each study (Fig. 1). To 
account for uncertainty associated with the passive sampling hypoth- 
esis, we calculated z-scores ((observed — expected)/s.d.(expected)) for 
each biodiversity index and for each fragment smaller than the largest 
in each study. To derive the z-score, we calculated the expected value 
and its s.d. for each biodiversity index from the resampling process 
for the largest fragment in each study. We used the observed value for 
each of the biodiversity indices for each smaller fragment to calculate 
the z-scores. As the standardized number of individuals was defined 
as Nera = Nops/ SE, there was no estimate of uncertainty, and z-scores 
were calculated only for the five biodiversity indices. 

We modelled thez-scores as a function of fragment size using models 
similar to those described in ‘Statistical analyses’. Fragment size was 
log-transformed and centred by subtracting the mean (log) fragment 
size from each observation before fitting models. For the z-score of 
each measure, we estimated the overall relationship with fragment 
size, and also allowed the slope and intercept of the relationship to 
vary for each study in the dataset (that is, the slope and intercept were 
estimated as fixed effects, as well as random effects for each study). 
z-scores were distributed around zero for all metrics, and compari- 
sons of models fit with Gaussian, asymmetric Laplacian (back-to-back 
exponential) or Student’s error distribution showed strong support 
for assuming error to follow a Student distribution, and it was used 
for all analyses. Similar to the models of biodiversity indices, we found 
positive slopes (95% credible intervals did not overlap zero) for the rela- 
tionship between z-scores of all metrics and fragment size (Extended 
Data Fig. 3). This suggests that our inference of pervasive ecosystem 
decay from Fig. 2 is robust to the inclusion of uncertainty in the pas- 
sive sampling hypothesis. 


Latitudinal variation 

To examine whether there was a latitudinal gradient in the slope of 
the relationship between fragment size and biodiversity indices, we 
modelled the study-level slope estimates as a function of absolute lati- 
tude. We used arandom-effects meta-analytic model that incorporated 
uncertainty associated with the study-level slope estimates (that is, the 
standard error of the posterior distributions) into our analysis; study 
was included as a random effect, and the model assumed a Gaussian 
error distribution and an identity link function. To examine whether the 
latitudinal gradients differed between taxa, we fit asecond model that 
included an interaction between absolute latitude and taxon group. 


Compositional dissimilarity 

To examine the relationship between dissimilarity components among 
fragments and differences in fragment size, we modelled the turnover 
and nestedness components of Jaccard dissimilarity’’, and the bal- 
anced abundance and abundance gradient components of Ruzicka 
dissimilarity’, as a function of the log-transformed ratio of the sizes for 
each fragment pair (that was centred before model fitting). We fit mod- 
els that estimated the overall relationship, and also allowed both the 
slope and intercept to vary for each study. Models assumed zero-one 


inflated beta distributions and logit-link functions for the turnover 
components, and zero inflated beta distributions (and logit-link func- 
tions) for the nestedness components. 


Model fitting 

For Bayesian inference and estimates of uncertainty, all models were fit 
using the Hamiltonian Monte Carlo (HMC) sampler Stan’ and coded 
using the brms package’. All models were fit with 4 chains and 2,000 
iterations, with 1,000 used as a warm-up. We used weakly regulariz- 
ing priors and visual inspection of the HMC chains showed excellent 
convergence. 


Sensitivity analysis 

Forty-seven studies had unequal numbers of samples across fragments 
and data were pooled (that is, sample design (iii)). This pooling of sam- 
ples means that data from larger fragments, which encompass alarger 
sampling extent, will also encompass more heterogeneity. This can, 
inturn, influence diversity estimates. To examine whether our infer- 
ences of ecosystem decay were sensitive to the inclusion of studies 
with sample design (iii), we removed those studies and refit the models 
to the remaining data (n=79 studies for which sampling extents were 
comparable). 

For all analyses, we needed to assign a fragment size to fragments 
labelled as continuous without a specified size. In the main results, we 
calculated their size as ten times larger than the next largest fragment 
ina given study, which was an arbitrary—but reasonable—assumption 
(that is, no habitat is ever continuous). However, to assess the sensitivity 
of our results, we varied this multiplier so that continuous fragments 
were either 2 or 100 times larger than the next largest fragment. 

The calculations of the biodiversity parameters used here typi- 
cally require counts of individuals of each species (that is, integer 
abundance values). However, a few of the studies in our dataset report 
non-integer abundance values for samples (for example, based on 
averages for subsamples within each sample, where the subsample 
data were not available). In the results discussed in the main text, we 
calculated the biodiversity parameters directly from these non-integer 
abundances. To assess the potential sensitivity of our findings to the 
use of non-integer abundances, we implemented two additional meth- 
ods. In one, we rounded all abundances to the next largest integer. 
In this way, we favoured rare species with respect to their relative 
abundance. The second method divided all abundances by the mini- 
mum abundance in the respective study. This resulted in the lowest 
abundance equalling 1 after scaling, but all relative abundances were 
maintained. 


Adding ecosystem decay to extinction forecasts 
Many published models of biodiversity loss implicitly or explicitly 
assume that scaling relationships of biodiversity are constant even 
when habitat is lost (that is, assume passive sampling). For example, 
forecasts based onthe species—area relationship or the endemics—area 
relationship implicitly assume that passive sampling is the primary 
process by which habitat size influences species diversity (or, rather, 
only consider immediate extinctions, and not extinction debt). Here 
we illustrate how estimates such as ours—which quantify the degree 
to which ecosystem decay leads to extinctions beyond passive sam- 
pling—could improve forecasts of biodiversity loss due to habitat loss. 
Ifsmaller fragments are simply passive samples from the assemblages 
found in larger areas, then the expected number of species lost witha 
loss of area a (from a total area, A) can be calculated using the endem- 
ics—area relationship”: 


i=1 


with species lost, 


S a N; 
Spassive = 5~ Su-a= > (4) =St 


N,is the total abundance of species i, and S is the total number of spe- 
cies in the region A. 

We calculate the additional species lost under ecosystem decay using 
the slope estimated from our model fit to standardized species richness 
(B=0.06). We assume that species lost owing to passive sampling equals 
Spassive and that additional species lost owing to ecosystem decay are 
proportional to the area lost a, to calculate S jecay = Spassive + Ba. 

To illustrate the difference between species loss owing to random 
expectations and the empirically derived loss including ecosystem 
decay, we simulated a regional assemblage using the mobsim R pack- 
age’. We calculated Syacsive AN Syecay USing the equations described 
above. We set A = 10,000, S = 1,000 and simulated species in A using 
a log-normal species abundance distribution (total community 
N=10,000, coefficient of variation =1) randomly distributed in space. 
We examined area lost, a, that ranged from 1to A, and showthe number 
of extinctions as a function of the area (a) lost (Extended Data Fig. 8). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All of the data used in this analysis are open access and available in 
ref. ” (117 of the datasets) and ref. '*? (5 of the datasets). Raw data 
(before standardization) are available from GitHub (https://github. 
com/FelixMay/FragFrame_1), and are mirrored on Zenodo (https:// 
doi.org/10.5281/zenodo.3862409). 


Code availability 


The Rcodeused for standardizing the data and doing the analyses pre- 
sented here are available from GitHub (https://github.com/FelixMay/ 
FragFrame_1), and are mirrored on Zenodo (https://doi.org/10.5281/ 
zenodo.3862409). 
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Extended Data Fig. 1| Simulations of the null expectation under the 
random sampling hypothesis for varying degrees of within-species 
aggregation. To evaluate the robustness of the null expectation ofa zero slope 
of biodiversity patterns in standardized samples with fragment size, we took 
different-sized samples froma simulated landscape and estimated biodiversity 
patterns. a, Examples of landscapes with different levels of intraspecific 
aggregation (left to right is from completely random to most-aggregated). 
From within each of these landscapes, we illustrate four different fragment 
sizes (shaded squares), and then take standardized (constant-sized) samples 
from each fragment. b, Density plots showing slope estimates of linear models 
fit tonumbers of individuals (top), species richness (centre) and evenness 


(bottom) asa function of fragment areas for 2,000 simulated landscapes of 
each level of aggregation. Both the response and fragment area were 
log-transformed before model fitting. Densities are shaded by quantiles and 
the black diamond shows the median for each combination of aggregation and 
metric; vertical dashed line shows the zero expectation and the median result is 
given. Insome cases, the median result lies very slightly above or below zero 
(though this does not seem to be associated with levels of aggregation). This is 
an outcome of the stochastic simulation we performed, and is sensitive to 
parameters and numbers of iterations, and thus we do not perform statistical 
tests. 
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Extended Data Fig. 2| Different measures of species richness related tothe 
size of habitat fragments. For each case, richness metrics were positively 
associated with the size of habitat fragments, supporting ecosystem decay as 
the predominant driver. a, Richness standardized to acommon number of 
individuals. b, Richness standardized toacommon sample completeness. 


c, Asymptotic richness. Solid black lines and shading show overall relationships 
and 95% credible intervals for each metric. The slope (8) coefficient and its 95% 
credible interval are shown at the top. Coloured lines show study-level 
relationships for taxon groups. 
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Extended Data Fig. 3 | Incorporation of uncertainty by calculating z-scores 
of observed versus null-expected outcomes. Because there is always 
uncertainty surrounding the expected outcomes based on passive sampling, 
we repeated analyses of all metrics recalculated as z-scores. After 
standardization, z-scores ((observed — expected)/s.d.(expected)) were 
calculated for the following. a, Standardized species richness. b, Standardized 
evenness (Sp)_).c, Richness standardized to acommon number of individuals. 
d, Richness standardized toacommonsample completeness. e, Asymptotic 


richness. As with the direct measurements, analyses of z-scores show—in total— 
greater biodiversity loss than expected from passive sampling in smaller 
habitat fragments, and thus support the ecosystem decay hypothesis. Solid 
black lines and shading show overall relationships and 95% credible intervals 
for each metric. Inset shows B-slope coefficient and its 95% credible interval. 
Coloured lines show study-level relationships for taxon groups. The 
B-coefficients are not directly comparable to the results from Fig. 2 and 
Extended Data Fig. 2, owing to differences inthe response scale of thez-scores. 


Fragment size slope estimated 


Metric without studies with pooled Q2.5 Q97.5 
samples 
Standardised species richness (Ss) 0.044 0.024 0.066 
Standardised number of individuals (Nsta) 0.034 0.003 0.065 
Standardised evenness (Spe) 0.036 0.012 0.061 
Richness standardised to a common number of 0.049 0.029 0.069 
individuals (Sp) 
Coverage standardised richness (Scov) 0.043 0.018 0.067 
Asymptotic richness (Schao 0.043 0.021 0.065 
b Standardised number of individuals ¢ Standardised species richness d Standardised evenness 
0.154 54 
024 78 | 9” 
* 7 
7 
cd 0.104 ss 
5 ) gt 0.1 © 
014 
a ! 0.05 4 
£ 
2 
e ee Imputation . aco Fs 
S| gh 443] ” 
4 
»” —p- 2 > 4 0.059 . & 
0.1 16 4 
T T T T T T T T T a T T T T 1 
0.1 0.0 0.1 0.2 -0.05 0.00 0.05 0.10 0.15 -0.05 0.00 0.05 0.10 0.15 


Reference imputation 


Extended Data Fig. 4| Testing of robustness of results to alternative 
methods. a, Testing the sensitivity of our results to the exclusion of 47 studies 
in which data were pooled, and thus sample extent could not be controlled 
(Methods). Analyses on this subset of data (n= 79) were consistent with those 
from the full dataset (that is, the 95% credible intervals did not overlap zero). 
b-d, Testing the sensitivity of the results to decisions we made when imputing 
missing sizes of habitats labelled as continuous and the treatment of 
non-integer species abundances (Methods). In each case, the reference 
scenario imputed the size of continuous habitats to have 10x the area of the 
next largest fragment and calculated the biodiversity metrics using the 
non-integer abundance values. Alternate combinations were: (1) imputed area 
for continuous habitats assumed to be 2x that of next biggest fragment, and 
non-integer values unchanged; (2) imputed area for continuous habitats 
assumed to be 100x that of the next biggest fragment, and non-integer values 


unchanged; (3) imputed area for continuous habitats assumed to be 10x that of 
the next biggest fragment, and non-integer values rounded up; (4) imputed 
area for continuous habitats assumed to be 10x that of the next biggest 
fragment and all abundance values divided by the lowest value within each 
study, resulting in the lowest abundance equalling one, but retaining the same 
relative abundances. b, Slope estimates for the relationship between 
standardized numbers of individuals and fragment size. c, Slope estimates for 
the relationship between standardized species richness and fragment size. 

d, Slope estimates for the relationship between standardized evenness and 
fragment size. Colours depict different auxiliary decisions and imputation 
required inthe analysis (Methods). Small points represent study-level 
estimates, and large points and error bars are the overall estimates and their 
95% credible intervals. 
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Extended Data Fig. 5 | Study-level variation in the number of individuals 
and evenness. a-h, Density plots of posterior distributions of study-level 
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line and surrounding shading show the overall slope estimate and its 95% 
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Extended Data Fig. 7 | Relationship between size of habitat fragment and 
species composition. Overall, we found that turnover contributes more than 
nestedness to pairwise dissimilarity between fragments within a study, but 
shows contrasting patterns with increasing fragment size differences. 

a, Turnover component of Jaccard dissimilarity. b, Turnover or balanced 
abundance component of Ruzicka dissimilarity. c, Nestedness component of 
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Extended Data Table 1| Models of standardized species richness (S,,,) 


a. Model: taxon x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 1.546 0.231 1.091 1.993 
Fragment size (log scale) 0.070 0.025 0.022 0.118 
Taxa (birds) 1.374 0.319 0.726 1.983 
Taxa (invertebrates) 1.159 0.263 0.648 1.666 
Taxa (mammals) 0.152 0.298 0.428 0.721 
Taxa (plants) 1.504 0.320 0.874 2.112 
Fragment size:taxa (birds) 0.015 0.033 0.077 0.052 
Fragment size:taxa (invertebrates) 0.023 0.027 0.076 0.029 
Fragment size:taxa (mammals) 0.008 0.031 0.055 0.070 


b. Model: region x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 2.753 0.283 2.201 3.308 
Fragment size (log scale) 0.063 0.026 0.014 0.114 
Region (Asia) -0.182 0.367 0.933 0.513 
Region (Central America) -0.075 0.405 -0.861 0.732 
Region (Europe) -0.205 0.386 -~0.947 0.552 
Region (North America) -1.151 0.523 -2.183 0.099 
Region (Oceania) -0.202 0.471 -1.139 0.696 
Region (South America) -0.427 0.320 -1.057 0.196 
Fragment size:Region (Asia) 0.006 0.031 0.055 0.069 
Fragment size:Region (Central America) -0.009 0.036 0.079 0.060 
Fragment size:Region (Europe) -0.056 0.032 0.118 0.007 
Fragment size:Region (North America) 0.042 0.046 0.048 = 0.133 
Fragment size:Region (Oceania) 0.020 0.041 0.062 0.098 
Fragment size:Region (South America) -0.007 0.029 -0.064 0.049 


c. Model: time since fragmentation x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 2.352 0.244 1.878 2.840 
Fragment size (log scale) 0.081 0.023 0.036 0.125 
Time since fragmentation (20-100 years) 0.176 0.269 0.355 0.706 
Time since fragmentation (> 100 years) 0.018 0.311 0.625 0.618 
Fragment size:time since fragmentation (20-100 years) 0.015 0.025 -0.064 0.034 


Fragment size:time since fragmentation (> 100 years) -0.065 0.028 0.121 0.012 


d. Model: matrix quality x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 2.731 0.295 2.159 3.304 
Fragment size (log scale) 0.000 0.028 0.055 0.055 
Matrix (intermediate) -0.201 0.316 0.822 0.422 
Matrix (harsh) -0.622 0.359 -1.300 0.106 
Fragment size:Matrix (intermediate) 0.048 0.030 0.010 0.107 
Fragment size:Matrix (harsh) 0.101 0.033 0.038 0.166 


a-d, Each model includes two-way interactions between fragment size and taxon group (a), region (b), time since fragmentation (ce) and matrix quality (d). Estimate is the mean of the posterior 
distribution; est.error is the s.d. of the posterior distribution; and Q2.5 and Q97.5 are the 2.5% and 97.5% (percentiles), respectively, of the posterior distribution. 


Extended Data Table 2 | Models of the standardized number of individuals (N,,,) 


a. Model: taxon x fragment size interaction 


Intercept 3.130 0.404 2.363 3.939 
Fragment size (log scale) 0.050 0.041 0.034 0.132 
Taxa (birds) 1.563 0.561 0.454 2.652 
Taxa (invertebrates) 1.865 0.446 0.992 2.725 
Taxa (mammals) 0.280 0.513 -0.767 1.284 
Taxa (plants) 2.332 0.549 1.244 3.410 
Fragment size:taxa (birds) -0.048 0.056 0.157 0.063 
Fragment size:taxa (invertebrates) -0.005 0.045 -0.095 0.086 
Fragment size:taxa (mammals) 0.028 0.051 -0.070 0.128 
Fragment size:taxa (plants) -0.012 0.060 0.132 0.104 
b. Model: region x fragment size interaction 

Parameter Estimate Est.Eror Q2.5 Q97.5 
Intercept 4.897 0.461 4.014 5.831 
Fragment size (log scale) -0.024 0.042 0.107 0.058 
Region (Asia) -0.218 0.619 -1.447 1.009 
Region (Central America) 0.178 0.662 -1.094 1.502 
Region (Europe) 0.624 0.635 -1.884 0.614 
Region (North America) -1.857 0.888 -3.603 0.161 
Region (Oceania) -0.498 0.779 -2.094 0.998 
Region (South America) -0.459 0.523 -1.511 0.552 
Fragment size:Region (Asia) 0.078 0.051 -—0.018 0.179 
Fragment size:Region (Central America) 0.058 0.060 0.059 0.176 
Fragment size:Region (Europe) 0.009 0.054 0.099 0.116 
Fragment size:Region (North America) 0.206 0.076 0.052 0.348 
Fragment size:Region (Oceania) 0.124 0.066 -0.005 0.254 
Fragment size:Region (South America) 0.081 0.048 0.014 0.175 


c. Model: time since fragmentation x fragment size interaction 


Parameter Estimate Est.Eror Q2.5 Q97.5 
Intercept 4.360 0.414 3.556 5.160 
Fragment size (log scale) 0.079 0.038 0.004 0.150 
Time since fragmentation (20-100 years) 0.295 0.454 0.581 1.191 
Time since fragmentation (> 100 years) 0.114 0.523 -1.122 0.936 
Fragment size:time since fragmentation (20-100 years) -0.038 0.042 0.118 0.046 


Fragment size:time since fragmentation (> 100 years) -0.054 0.046 0.144 0.037 


d. Model: matrix quality x fragment size interaction 


Parameter Estimate Est.Eror Q2.5 Q97.5 
Intercept 5.828 0.508 4.846 6.826 
Fragment size (log scale) -0.068 0.049 -0.166 0.029 
Matrix (intermediate) -1.381 0.546 2.453 0.313 
Matrix (harsh) -1.662 0.622 -2.865 0.429 
Fragment size:Matrix (intermediate) 0.115 0.051 0.017 0.214 
Fragment size:Matrix (harsh) 0.134 0.056 0.022 0.242 


a-d, Each model includes two-way interactions between fragment size and taxon group (a), region (b), time since fragmentation (c¢) and matrix quality (d). Estimate is the mean of the posterior 
distribution; est.error is the s.d. of the posterior distribution; and Q2.5 and Q97.5 are the 2.5% and 97.5% (percentiles), respectively, of the posterior distribution. 
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Extended Data Table 3 | Models of standardized evenness (S,,,) 


a. Model: taxon x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 1.226 0.179 0.874 1.577 
Fragment size (log scale) 0.063 0.025 0.013 0.112 
Taxa (birds) 1.177 0.250 0.695 1.680 
Taxa (invertebrates) 0.607 0.200 0.208 0.997 
Taxa (mammals) 0.044 0.227 0.404 0.484 
Taxa (plants) 0.799 0.249 0.317 1.293 
Fragment size:taxa (birds) 0.008 0.033 -0.055 0.073 
Fragment size:taxa (invertebrates) -0.030 0.028 -0.084 0.025 
Fragment size:taxa (mammals) 0.007 0.032 -0.057 0.071 
Fragment size:taxa (plants) -0.005 0.036 0.075 0.067 


b. Model: region x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 2.056 0.218 1.629 2.487 
Fragment size (log scale) 0.090 0.025 0.041 0.140 
Region (Asia) -0.389 0.271 0.915 0.147 
Region (Central America) -0.252 0.312 -0.881 0.342 
Region (Europe) -0.208 0.290 -0.782 0.373 
Region (North America) -0.501 0.406 -1.297 0.286 
Region (Oceania) -0.075 0.362 -0.786 0.627 
Region (South America) -0.411 0.247 0.916 0.068 
Fragment size:Region (Asia) -0.035 0.030 -0.094 0.026 
Fragment size:Region (Central America) -0.065 0.037 0.135 0.009 
Fragment size:Region (Europe) -0.071 0.032 0.135 -0.009 
Fragment size:Region (North America) -0.025 0.047 0.119 0.069 
Fragment size:Region (Oceania) -0.036 0.041 0.112 0.045 
Fragment size:Region (South America) -0.037 0.029 0.092 0.021 


c. Model: time since fragmentation x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 1.736 0.187 1.372 2.105 
Fragment size (log scale) 0.073 0.023 0.027 0.118 
Time since fragmentation (20-100 years) 0.032 0.207 0.385 = 0.430 
Time since fragmentation (> 100 years) 0.042 0.234 0.483 0.410 
Fragment size:time since fragmentation (20-100 years) -0.010 0.025 0.059 0.040 


Fragment size:time since fragmentation (> 100 years) -0.065 0.028 0.121 0.009 


d. Model: matrix quality x fragment size interaction 
Parameter Estimate Est.Eror Q2.5 Q97.5 


Intercept 1.687 0.223 1.260 2.123 
Fragment size (log scale) 0.034 0.030 0.025 0.094 
Matrix (intermediate) 0.169 0.235 (0.278 0.619 
Matrix (harsh) -0.228 0.273 (0.772 0.306 
Fragment size:Matrix (intermediate) 0.004 0.032 -0.060 0.067 
Fragment size:Matrix (harsh) 0.063 0.036 -0.008 0.132 


a-d, Each model includes two-way interactions between fragment size and taxon group (a), region (b), time since fragmentation (c) and matrix quality (d). Estimate is the mean of the posterior 
distribution; est.error is the s.d. of the posterior distribution; and Q2.5 and Q97.5 are the 2.5% and 97.5% (percentiles), respectively, of the posterior distribution. 


nature research es 


Last updated by author(s): yyyy-MiM-DD 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see our Editorial Policies and the Editorial Policy Checklist. 


Statistics 
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The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
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[=] A description of all covariates tested 
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Data collection Data were collected from a literature search presented in Chase et al. 2019 (https://doi.org/10.1002/ecy.2861). For that data publication, we 
used a number of computational resources in R to scrap, clean and organize data. All code is presented in Chase et al. 2019 as well as in the 
code deposited for this study at: https://github.com/FelixMay/FragFrame_1 and mirrored at Zenodo https://doi.org/10.5281/zenodo.3862409 


Data analysis All code for analyses are deposited at: https://github.com/FelixMay/FragFrame_1 and mirrored at Zenodo https://doi.org/10.5281/ 
zenodo.3862409 
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All data were collected from a literature search and are available in Chase et al. 2019 (https://doi.org/10.1002/ecy.2861). 
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Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description This study compiled data from 123 studies examining habitat fragmentation and its influence on effort-controlled patterns of 
multivariate biodiversity metrics. It tested the primary hypothesis of passive sampling versus ecosystem decay in driving biodiversity 
patterns. 

Research sample Each study contained a site x species matrix, together with sampling and site meta-data. That is, for each habitat fragment in each 


study, we compiled the abundance of each species encountered in a given sample. For many samples, a considerable amount of 
effort was placed on standardizing effort so that fair comparisons could be made. 


Sampling strategy As this was a meta-analysis, studies differed widely in their sampling methodology. This included counts of trees, sampling quadrats 
and transects, traps (for flying and crawling organisms, etc). They also differed considerably in the methodology for standardizing (or 
not standardizing) effort within each study. The key for each study was that we could quantify effort within each fragment and use 
that to compare sample-effort standardized patterns of diversity across sites. 


Data collection Again, as this was a meta-analysis of more than 100 studies of multiple groups of organisms, many hundreds of people were involved 
in the data collection. The data compilation, however, was lead by Chase, along with help from co-authors and two technicians in 
Chase's lab group (Alban Saquois and Mario Liebergesell). All of this is recorded in the data paper developed expressly for this 
purpose. 


Timing and spatial scale As a meta-analysis, the time period ranged from datasets collected from the 1970s until 2019; most, however, took place between 
1990-2019. The spatial scale and sampling effort varied considerably between studies, but in all cases, sampling effort was carefully 


controlled so that standardization could take place. 


Data exclusions No Data were excluded from the analyses. 


Reproducibility This was a compilation of mostly observation studies. Reproducibility was sometimes done for data collection over multiple years and 
seasons, but was highly variable among studies. We have provided the raw data and explicit code for how we dealt with data. 


Randomization As these were primarily observation data from natural forest fragments, randomization at the treatment level was not possible. 
However, for most studies, sampling within fragments was randomized with respect to location. 


Blinding Again, data come from hundreds of studies. In some, blinding was possible, not in most, as the observers would clearly know which 
treatment they were in when data were collected. 
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DNase I hypersensitive sites (DHSs) are generic markers of regulatory DNA’ > and 
contain genetic variations associated with diseases and phenotypic traits® *. We 


created high-resolution maps of DHSs from 733 human biosamples encompassing 
438 celland tissue types and states, and integrated these to delineate and numerically 
index approximately 3.6 million DHSs within the human genome sequence, providing 
acommon coordinate system for regulatory DNA. Here we show that these maps 
highly resolve the cis-regulatory compartment of the human genome, which encodes 
unexpectedly diverse cell- and tissue-selective regulatory programs at very high 
density. These programs can be captured comprehensively by a simple vocabulary 
that enables the assignment to each DHS of a regulatory barcode that encapsulates its 
tissue manifestations, and global annotation of protein-coding and non-coding RNA 
genes in a manner orthogonal to gene expression. Finally, we show that sharply 
resolved DHSs markedly enhance the genetic association and heritability signals of 
diseases and traits. Rather than being confined to a small number of distal elements or 
promoters, we find that genetic signals converge on congruently regulated sets of 
DHSs that decorate entire gene bodies. Together, our results create a universal, 
extensible coordinate system and vocabulary for human regulatory DNA marked by 
DHSs, and provide a new global perspective on the architecture of human gene 


regulation. 


A fundamental challenge in modern biology is to delineate with the 
highest possible precision the repertoire of regulatory DNA elements 
encoded within the human genome sequence. A universal feature of 
active cis-regulatory elements—promoters, enhancers, silencers, chro- 
matin insulators or enhancer blockers, and locus control regions—is 
focal alteration in chromatin structure triggered by binding of tran- 
scription factors (TFs), which supplants a canonical nucleosome and 
renders the underlying DNA accessible to nucleases and other protein 
factors’’. For more than 40 years””°”, DHSs have provided reliable 
signposts for high-precision delineation of regulatory DNA in complex 
genomes! ®. DHSs typically mark compact (less than 250 base pair (bp)) 
elements, and their appearance over a cis-regulatory region signifies 
its actuation (readying for activation), which may occur before, or 
coincident with, its functional activation. DHS mapping thus provides 
a generic tool for illuminating both active and potential regulatory 
landscapes. 

The advent of genome-scale mapping of DHSs”® and its applica- 
tion to diverse human and mouse cell and tissue types” has yielded 
many insights into the organization”’, evolution” ”, activity”, and 
function’®”!” of human regulatory DNA in both normal and malig- 
nant states”’. A cardinal property of regulatory DNA is that its acces- 
sibility is cell type- and state-selective, with only a small fraction of 


all genome-encoded elements becoming actuated in a given cellular 
context’*”3, 

The overwhelming majority of disease- and trait-associated vari- 
ants identified by genome-wide association studies (GWASs) lie in 
non-coding regions of the genome, and these variants are most strongly 
enriched in DHSs mapped in disease-relevant cell contexts°’. DHSs also 
collectively contain the GWAS variants that account for the majority 
of trait heritability explained by genotyped single-nucleotide poly- 
morphisms (SNPs)°. Deeper insights into the connection between 
GWAS variants and gene regulation have been limited by the lack of 
comprehensive annotations that capture the biological behaviour of 
regulatory DNA. 

As genome-:scale data from diverse cellular contexts have accumu- 
lated, systematic annotation of cell type- and state-selective DHSs has 
grown increasingly challenging, and it has also become evident that 
large sets of DHSs distributed widely across the genome may share 
common regulatory programs”. However, the annotation and analysis 
of state-selective behaviours has been hampered by the lack of acom- 
mon coordinate system for DHSs. 

Here we sought to expand the breadth of high-quality DHS maps, and 
to unify them into acommon reference framework that achieves precise 
genomic annotation by integrating observed biological variability 
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Fig. 1| Index of DHSs in the human genome. a, DNA accessibility assayed 
across multiple biosamples (indicated) from the main human organ systems. 
Of 733 biosamples, 531 were derived from primary cells and tissues. b, Example 
locus on chromosome 1, showing DNase I cleavage density in haematopoietic 
biosamples (right) with cell type-selective differences. c, Outline of DHS index 
procedure; 76.5 million DHSs aggregated across individual datasets jointly 


in the manifestation of accessibility at individual elements, and that 
captures complex cell-selective behaviours in a quantitative fashion. 
We report a coherent framework and demonstrate its utility for the 
annotation of human regulatory DNA and gene landscapes; for defin- 
ing how regulatory programs are encoded within the genome; and for 
clarifying links between genetic signals and gene regulation to enable 
new insights into the organization and interpretation of non-coding 
variation associated with diseases and traits. 


Index of consensus human DHSs 


To create deeply sampled reference maps of human regulatory DNA 
marked by DHSs, we performed DNase I hypersensitive site sequenc- 
ing (DNase-seq)” ona wide range of human cell and tissue biosamples 
that span all major human organ systems (Fig. 1a). Reference-grade 
data were created by rigorous quality screening for complex libraries 


12 3 4 5 6 7 8 9 10 11 12 13 14 15 

Number of organ systems 
delineate and annotate 3.59 million consensus DHSs. d, Examples of consensus 
DHSs with varying cell-type selectivity and genome positional stability. 
Annotations include consensus DHS coordinates (start/end), single-base 
‘centroid’, ‘core’ region aggregating centroids across biosamples, and a unique 
numerical identifier. e, Number of organ systems across which DHSs are 
shared. 


yielding high signal-to-noise ratio data (Methods), and were aggregated 
with prior high-quality data from the ENCODE” and Roadmap Epig- 
enomics”~ projects. We conservatively selected 733 biosamples that 
represent 438 cell or tissue types and states (Fig. la, Supplementary 
Table 1, Methods), the majority of which were derived from primary ex 
vivo cells and tissues (72% of samples) or from primary cellsin culture 
(11%), with the remainder (17%) from immortalized cell lines. Collec- 
tively these samples represent an approximately 5.5-fold expansion of 
sampled cell and tissue types and states relative to the previous phase 
of ENCODE” (Extended Data Fig. 1a), and the resulting data reveal rich 
and varied patterns of DNase I hypersensitivity (Fig. 1b). 


Common coordinates for regulatory DNA 


We sought to create a precise and durable common reference frame- 
work for genomic elements that encode DHSs by (i) comprehensively 
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and stringently (0.1% false discovery rate (FDR)) delineating DHSs 
within each biosample; (ii) integrating individual biosample DHS 
maps to define archetypal DHS-encoding sequence elements within 
the human genome sequence; and (iii) assigning to each archetypal 
element a unique numerical identifier (Fig. Ic). 

We identified an average of 104,433 DHSs per biosample, and atotal 
of 76,549,656 DHSs across all 733 biosamples. To delineate arche- 
typal DHS-encoding genomic elements, we developed the consen- 
sus approach outlined in Fig. 1c and Extended Data Fig. 1b, c. First, we 
computed the summit coordinate (1 bp) of each DHS peak and aligned 
these across all biosamples to define a consensus centroid for each 
archetypal DHS. To resolve DHS boundaries, we collated the local linear 
extent of DNase I hypersensitivity into a consensus range (Methods). 
We then combined centroids and boundaries into a single index of 
3,591,898 spatially distinct DHS-encoding sequence elements, greatly 
eclipsing both the number (approximately 2.5-fold) and precision of 
DHSs delineated during earlier phases of ENCODE (Extended Data 
Fig. 1d). In addition to a consensus summit (centroid) and start and 
end coordinates, each archetypal DHS additionally comprises a ‘core’ 
region that represents empirical confidence bounds on the centroid 
(Fig. 1d, Extended Data Fig. le). Each archetypal DHS derives from an 
average of 21 biosamples, and because each DHS froma given biosample 
contributes to a single archetypal DHS, the provenance of each DHS in 
the index can be directly traced back to its contributing biosample(s). 

Finally, we assigned a unique identifier to each archetypal index DHS 
using anumerical schema (Fig. 1d) that (i) conveys the genomic localiza- 
tion of each DHS; (ii) enables unlimited extension to newly discovered 
elements; (iii) ensures compatibility with future reference genome 
builds and portability to personal genomes; and (iv) enables direct 
integration with DNase] footprints” or other experimental annotations 
(Methods). We also assigned confidence scores to all index DHSs that 
combine signal strength with propensity for repeated observation in 
independent biosamples (Extended Data Fig. If, g). 

Index DHSs are broadly distributed across annotated genic and repet- 
itive elements (Extended Data Fig. 2a-e). Fifty-three per cent of DHSs 
lie within introns, about 3% within non-coding exons and untranslated 
regions (UTRs), and about 2% are dually encoded within protein-coding 
exons (Extended Data Fig. 2c). Although DHSs are pronounced at anno- 
tated transcription start sites (TSSs), most localize to regions away 
from TSSs (Extended Data Fig. 2d, e). Fifty-four per cent of index DHSs 
overlap repetitive elements of all classes and subfamilies (Extended 
Data Fig. 2b), consistent with previous observations”, although overlap 
with DHS cores (41%) and centroids (37%) suggests that a more focused 
subset of DHSs derives regulatory machinery from repeats. 


Proportion of the genome that encodes DHSs 


The full extent ot the DHS landscape should define—or at least closely 
approximate—the canonical cis-regulatory compartment of the 
genome, the size of which has been the subject of considerable debate”. 
The roughly 3.6 million consensus DHSs have an average width of 204 bp 
(median 196 bp, interquartile range (IQR) 151-240 bp) and collectively 
span 665.57 Mb (21.55%) of the reference human genome sequence. DHS 
cores have an average width of 55 bp (median 38 bp) (Extended Data 
Fig. 2f) and span 197.74 Mb (6.4%) of the genome. DHS centroids also 
precisely mark the peak in evolutionarily conserved nucleotides within 
DHSs, and the corresponding trough in the average density of human 
genetic variants (Extended Data Fig. 2g), which are discontinuously 
distributed between TF-occupied and unoccupied subsegments of 
DHSs*>. Iterative subsampling of the 733 biosamples showed that the 
addition of any given new biosample would be expected to contribute 
about 1,676 new DHSs to the index (median 283, range 4-64,054, 95% 
confidence interval 1,344-2,009) (Extended Data Fig. 2h and Meth- 
ods). Extrapolating from this, the addition of an additional biosample 
collection of equivalent size (n = 733) would be expected to increase 
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annotated DHS elements by about 27% (Extended Data Fig. 2h). Notably, 
increasing biosamples should increase the precision of annotation and 
thus the resolution of some broader elements (suchas Extended Data 
Fig. 1c, second DHS from the right) to two or more distinct archetypal 
DHSs; however, such elements are in the minority. 


Cellular patterning of DNA accessibility 


DHSs are extensively shared across both individual biosamples and 
groups of biosamples from different organ systems (Fig. le, Extended 
Data Fig. 2i). It was previously reported that groups of widely distributed 
DHSs with closely shared cross-cell-type actuation patterns also shared 
biological functions such as enhancer activity’®. Patterns of index DHS 
actuation across the 733 biosamples (Fig. 2a) were complex, with both 
highly modular and less coherent structures (Fig. 2b). The majority of 
DHSs showed complex actuation patterns rather than simple cell-selective 
behaviour (Fig. le, Extended Data Fig. 2i), prompting us to develop a flex- 
ible approach for quantifying and annotating these patterns. 

In principle, the actuation of any given index DHS across cell states 
can be summarized by a limited number of biological ‘components’ 
combined ina weighted fashion. Orthogonally, the same components 
can be used to summarize the DHS repertoire of an individual biosa- 
mple. Because DHS-centric information can inform biosamples and 
vice versa, a key advantage of this approach is its potential to capture 
complex behaviours while providing biological interpretability. 


Avocabulary for regulatory patterns 


To simplify the matrix of 3,591,898 DHSs x 733 biosamples we applied 
non-negative matrix factorization’’ (NMF) (Extended Data Fig. 3a-d), 
a technique that was initially used in the field of computer vision for 
learning parts-based representations of objects and semantic features 
of text?’. We represented each DHS by a large enough number of com- 
ponents (k = 16) to ensure accuracy—that is, the degree to which the 
original matrix can be reconstructed from the components—while 
retaining potential for interpretability via assignment of components 
to established biological contexts such as known cell lineage relation- 
ships, or cell states known to be specified by specific regulatory factors 
(Fig. 2c, Extended Data Fig. 3e-g, Methods). 

To connect components with biological contexts, we identified the 
biosamples that were most strongly associated with each component, 
and the distribution of TF recognition sequences within DHSs that was 
most strongly associated with that component. For all components, 
the top contributing cell or tissue samples were notably coherent, 
enabling provisional assignment of a meaningful biological label to 
most components (Extended Data Fig. 4a—d, Methods). Enrichment 
of TF recognition sequences within the DHSs that were most strongly 
associated with each component revealed clear mappings between 
distinct sets of cell lineage- or state-specifying TFs and specific 
components (Extended Data Fig. 4e, f, Methods), orthogonal to the 
biosample-to-component mappings described above. Finally, we com- 
bined biosample-to-component mappings and TF-to-component map- 
pings to create a regulatory ‘vocabulary’ that captures the actuation 
pattern of a DHS across cell types and states (Fig. 2d, Supplementary 
Note). Notably, the interpretation of components does not change with 
further induced model sparsity (Extended Data Fig. 5a—c) or changes 
in cell type representation (Extended Data Fig. 5d). 


Biological annotation of individual DHSs 


We next sought to annotate each DHS with a regulatory ‘barcode’ 
that captures its tissue manifestations. The cross-biosample actua- 
tion pattern of each DHS is captured by linear combinations of NMF 
components (Fig. 2c, Extended Data Fig. 3a—c), providing a de facto 
barcode of its biological spectrum (Fig. 2c, e). DHSs that are selective 
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Fig. 2|Asimple vocabulary captures complex patterning of DHSs. a, DNA 
accessibility at 3.59 million consensus DHSs assayed across 733 biosamples 
encapsulated ina visually compressed DHS-by-biosample matrix. Recurring 
accessibility patterns indicate extensive sharing across cell contexts. Dark 
column (right) shows DHSs detected in (nearly) all datasets. b, Modular 
behaviour of DHS actuation illustrated by thousands of DHSs with similar 
cross-biosample accessibility patterns. c, Decomposition of DHS actuation 


for a single cell type or state are annotated by a single majority com- 
ponent (Fig. 2e, columns 1, 4, 5, 7-9); DHSs that occur in multiple cel- 
lular contexts are described by combinations of components (Fig. 2e, 
columns 2, 3, 6,10); and constitutive DHSs are annotated by mixtures 
of all components (Fig. 2e, column 10), including a component that 
describes tissue-invariant behaviour. In this schema, DHSs with simi- 
lar cross-biosample actuation patterns exhibit similar mixtures of 
components. For analytical practicality and visual compactness, the 
annotation of each DHS can be further summarized using its strongest 
single component (Fig. 2e, bottom); we use this summary vocabulary 
for the analyses described below. 


Dense encoding of regulatory information 

The above results indicate that DHSs have the potential for surpris- 
ingly diverse biological regulatory patterns that combine coordinated 
positive (actuation) and negative (quiescent) behaviours. As the over- 
whelming majority of DHSs fall into a tight size range that stays roughly 
constant with increasing numbers of biosamples and does not vary 
with the complexity of component barcodes (Extended Data Fig. 2f), 
archetypal DHS elements must therefore encode regulatory informa- 
tion with extremely high linear density. 


patterns across 733 biosamples into 16 components using NMF. The cellular 
patterning of each DHS is described using a mixture of components, indicated 
by distinct colours. d, DHS component labels provide a regulatory vocabulary 
for DHSs. e, Component mixtures for ten example DHSs with varying degrees 
of component specificity. The biosample dataset most strongly associated 
with each component is shown. Bottom, annotation of individual DHSs witha 
single dominant component. 


Regulatory annotation of human genes 


The function of many genes is closely connected to their regulated 
expression across cells and tissues, and hence to the activity spectra 
of their cognate regulatory elements. We found that DHSs with simi- 
lar component annotations were highly clustered along the genome 
(Extended Data Fig. 6a, b, Methods), particularly over gene bodies and 
their immediate flanking regions (Extended Data Fig. 6c), which col- 
lectively capture 65% of all DHSs (Extended Data Fig. 6d-g). We thus 
reasoned that integration of the components of DHSs overlying a given 
gene could be used to annotate its likely functional compartment(s). 
The existence of coordinately regulated DHSs in gene bodies cannot 
be ascribed to transcriptional activity per se, which produces only 
very minor changes in the general DNase I sensitivity baseline. Quan- 
tification of the enrichment of congruently annotated DHSs around 
56,832 GENCODE genes (protein-coding and non-coding) genome-wide 
revealed 20,658 genes (5% FDR) with significant clustering along the 
genome of DHSs that belonged to the same component (Fig. 3a-d, 
Supplementary Table 2). Notably, the gene body-centric approach to 
annotation captured 70% more genes, and more genes that are likely to 
be biologically significant, than a TSS-centric approach (Extended Data 
Fig. 6h, i). Only asubset of gene body DHSs contributed to component 
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assignments (median, 38%; IQR, 26-54%), consistent with the fact that 
many distal regulatory elements localize within the bodies of genes 
other than the ones that they regulate. 

Of 20,291 GENCODE protein-coding genes, more than half (54.1%) 
could be assigned a regulatory component based on their overlying 
DHSs (Fig. 3b). To determine whether these assignments were con- 
cordant with other functional annotations, we assessed (i) whether the 
genes most confidently annotated bya given DHS component reflected 
their known function(s), and (ii) whether genes annotated with a par- 
ticular component are maximally expressed in cell types that match 
or are closely related to those components. The top genes annotated 
by the lymphoid component are all involved in immune response and 
disease (Fig. 3e, Extended Data Fig. 7a). Similar relationships were 
observed for other categories of gene (Fig. 3e, Extended Data Fig. 7b-d), 
including those annotated by the myeloid/erythroid component (eryth- 
ropoiesis or haematopoietic stem cell genes), a stromal component 
(collagen genes and fibronectin), and the tissue-invariant component 
(housekeeping genes). This phenomenon was particularly notable for 
TF-encoding genes” such as lineage-specifying master regulators 
of cardiac development (cardiac component; Fig. 3f, Extended Data 
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per component. Top five results for all protein-coding genes (e) and TF genes 
subset (f), for selected components. g, Correspondence between regulatory 
annotation and RNA expression shown using relative transcriptional activity 
across a panel of component-matched tissues and cell types (log, observed/ 
expected ratios). h, Putative TF-dependent regulatory elements defined by 
DHSs exclusively sharing regulatory components with genes encoding a given 
TF that also contain an occupied (footprinted) cognate TF motif. 


Fig. 7f) or the development of other organ systems (Extended Data 
Fig. 7e-h). 

To explore the concordance between DHS vocabulary annotations 
and gene expression across cell states, we investigated an indepen- 
dently generated compendium of more than 100,000 uniformly pro- 
cessed RNA sequencing (RNA-seq) datasets”. After matching DHS 
components with tissue-relevant expression datasets (Methods), we 
found strong correspondence between the vocabulary-based annota- 
tion of genes and the cell or tissue types in which they were maximally 
expressed (Fig. 3g). In many instances, DHS vocabulary annotation 
and gene expression offered different but complementary views of 
gene characteristics. For example, the transferrin receptor (TFRC) is 
responsible for cellular iron uptake and is required for erythropoiesis. 
TFRCRNA is most highly expressed in tissues from the placental com- 
ponent (Extended Data Fig. 7i), where TFRC is known to be involved in 
trophoblast membranes. From the perspective of regulation, however, 
TFRC is the most strongly associated gene in the myeloid/erythroid 
component (Fig. 3e, Extended Data Fig. 7b), in line with its core iron 
transport functionality. Analogously, the gene for HNF4G, a TF that is 
crucial for liver development, is the most strongly associated gene in 


the digestive component (Fig. 3e, Extended Data Fig. 7g); however, its 
RNA is expressed most highly in cells and tissues encompassed under 
the renal component (Extended Data Fig. 7i). DHS-centric annota- 
tions thus provide an orthogonal yet complementary view relative 
to expression data alone by providing a window into regulation. As 
DHSs greatly outnumber genes, DHS landscapes are in principle more 
information-rich than gene expression data alone. 


Annotating genes with unknown functions 


Despite intensive study, the function of many human genes remains 
obscure, particularly for those that are expressed at low levels or that 
have highly cell-selective expression patterns—for example, zinc-finger 
(ZNF) TFs*°” or long non-coding RNA genes*. Nearly half of ZNF TFs 
(43.7%) could be annotated with a DHS component (Extended Data 
Fig. 8a), indicating their likely biological sphere of activity. Among long 
non-coding RNA genes, 38.7% could be mapped to DHS components 
(Extended Data Fig. 8b), as could 18% of pseudogenes** (Extended 
Data Fig. 8c), which might reflect remnants of regulatory states that 
existed before ancient gene duplications. Beyond genes, we reasoned 
that entire pathways could be annotated using the DHS landscapes 
of their constituent genes (Extended Data Fig. 8d). For instance, the 
Kyoto Encyclopedia of Genes and Genomes (KEGG)*® pathway ‘allograft 
rejection’ (a paradigmatic immune response) is strongly enriched for 
the lymphoid component (Extended Data Fig. 8e), consistent with the 
concept that genes involved in similar biological processes should 
share similar patterns of regulatory element activity. 


Connecting DHS actuation to specific TFs 


We reasoned that the coalescence of congruently annotated DHSs 
and genes, plus the availability of high-quality motif databases and 
newly available DNase | footprinting data”, could enable the system- 
atic discovery of regulatory regions for which actuation patterns were 
likely to be driven, at least in part, by particular TFs. We identified 454 
TFs with known sequence recognition motifs for which the encoding 
genes were annotated by aDHS component. We next identified 189,318 
DHSs genome-wide (per TF median 149, IQR 47-477 DHSs) that (i) were 
exclusively annotated by acomponent matching that of the TF gene, 
and (ii) showed occupancy of the cognate motif by footprinting” in 
a component-matched biosample (Fig. 3h). Such DHSs are likely to 
be highly functionally dependent on their associated TF, and should 
provide arich substrate for experimental manipulations to investigate 
connections between TFs and regulatory functions. 


Annotating genetic association signals 


We next investigated whether DHS annotations could expand insights 
into the role(s) of genetic variation in regulatory DNA, and thus provide 
amore meaningful framework for interpreting the pathophysiological 
basis of disease and trait associations. A rank-based analysis of disease 
or trait against DHS component associations (explicitly controlling for 
large scale linkage disequilibrium (LD) structure; Methods) revealed 
increasingly strong component-specific enrichments of association 
signals across diverse traits (Fig. 4a, Extended Data Fig. 9a, b). Inmany 
cases these enrichments exceeded those obtained by considering only 
DHSs detected in biosamples most closely related to the relevant DHS 
component (for example, lymphoid cell biosamples versus lymphoid 
component; Fig. 4a, Extended Data Figs. 4a, 9c). 

Quantifying the extent to which DHS annotations captured 
SNP-based trait heritability** (h,”) (Fig. 4b) revealed a strong increase 
in heritability enrichment for trait-relevant DHS components (Fig. 4b, 
coloured bars) relative to all index DHSs (Fig. 4b, grey bars) or toalarge 
panel of 85 baseline annotations (Fig. 4b, white bars; top three annota- 
tions shown). Heritability was markedly enriched specifically within 


DHS ‘core’ regions, providing orthogonal evidence for the delineation 
and importance of this subcompartment (Fig. 4b). 

To generalize these observations, we compiled more than 1,300 traits 
with SNP-based heritability of at least 1% from the UK Biobank project?” 
and from curated published data’®. Of these, 261 diseases and traits 
showed highly significant component-specific enrichment in herit- 
ability, particularly for pathophysiologically relevant DHS components 
(Fig. 4c, Extended Data Fig. 9d; 1% FDR). Restricting DHS delineations to 
‘core’ regions again yielded significantly greater enrichment compared 
to full DHSs (Extended Data Fig. 9e, f). 

To remove potentially confounding contributions from multiple 
genomic annotations that overlap the same SNP (for example, a DHS 
that overlaps a coding region of a gene annotated with a particular DHS 
component), we quantified the statistical significance of DHS compo- 
nent heritability contributions while controlling for the contribution of 
all other annotations (Methods). For virtually all reported traits, DHS 
component annotations significantly (P< 0.01) captured SNP-based 
trait heritability (Fig. 4d, black line). 

We next performed cell type-specific heritability analyses” to 
quantify the concentration of trait-associated genetic signals in DHSs 
annotated by specific DHS components, relative to the full repertoire 
of DHSs mapped in disease- or trait-relevant cell types (Methods). 
Component-annotated DHSs produced significant improvements in 
capturing trait heritability compared to individual biosample maps 
(P<2.2 x10; Fig. 4d, grey solid line). At the level of specific traits, in 
68 out of 261 cases (26%), DHS component annotations captured trait 
heritability better than individual DNase-seq datasets (Fig. 4e). We 
conclude that the current index of highly resolved consensus DHSs 
markedly sharpens disease and trait association and heritability signals. 


Genetic signals span gene body DHSs 

The observed clustering of concordantly regulated DHSs along gene 
bodies (Fig. 3) led us to speculate that such DHSs were more likely than 
other DHSs to contain relevant genetic signals. To test this idea, we 
quantified trait heritability separately for component-concordant 
DHSs (17% of DHSs) and component-discordant DHSs (34%) within gene 
bodies (Fig. 4f). Concordant DHSs strongly contributed to SNP-based 
trait heritability relative to DHSs that were found in the same genes 
but with component annotations discordant with the annotations of 
the underlying gene, despite having lower average DNase-seq signal 
levels (Extended Data Fig. 9g) and more specialized utilization patterns 
(occurring in an average of 15 versus 25 biosamples). DHSs that were 
proximal to genes not labelled by any DHS component showed the 
weakest heritability contributions, and intergenic DHSs contributed 
only modestly (Fig. 4f, Extended Data Fig. 9h). Rather than being con- 
fined toasmall number of distal elements or promoters, it thus appears 
that genetic association signals are concentrated within congruently 
regulated sets of DHSs that decorate entire gene bodies. 


Discussion 


Here we have presented the most comprehensive and precise map of 
human DHSs, andacommon coordinate system and vocabulary for reg- 
ulatory DNA, creating a framework for global analyses of tissue-specific 
gene regulation and its intersection with human disease trait genetics. 
Regulation across cell types and states is a cardinal property of DHSs 
that is now captured in DHS components, greatly expanding the analyti- 
cal horizon beyond cell type-agnostic annotations such as chromatin 
states”*°, Common reference coordinates should additionally facilitate 
comparisons between large experimental datasets, and between human 
and mouse DHSs, which can now be directly linked in a manner that is 
robust to future mouse assemblies”. 

Given the scale of the data, it is natural to ask how complete and stable 
our current maps are. New biosamples will add new DHSs and annotate 
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Fig. 4] DHS components illuminate genetic associations and heritability. 
a, Association of DHSs with GWAS traits by component, shownas enrichment 
ratios for increasingly stringent subsets of variants (canonical genome-wide 
significance threshold of 5 x 10’ indicated). Grey, enrichments for top 15 
component-associated biosamples. b, Stratified LD-score regression (S-LDSC) 
for traits shown in a associates GWAS variants and DHS components. 
Heritability enrichment for the top three most enriched baseline annotations 
(white); the full DHS index (grey); and trait-relevant DHS components (red). 
*Statistically significant enrichment (one-sided test, 1% FDR). c, Enrichment of 
DHS component (x-axis) heritability across 261 GWAS traits (y-axis). Greyscale 
indicates heritability enrichment levels for statistically significant associations 
(one-sided tests, 1% FDR). Right, sampling of labels of enriched traits for each 
component. Arrows, traits fromaand b. d, Distribution of S-LDSC coefficient 


existing elements with ever higher precision. Adding 733 biosamples 
of equivalent biological breadth would increase the number of con- 
sensus DHSs by an average of 27%, with rapidly diminishing returns 
after that. From the current 21.55% it also is reasonable to predict that 
no more than 28% of the extant human reference sequence encodes 
cis-regulatory modalities that give rise to DHSs. 

It should now be possible to triangulate the genetics-to-gene- 
regulation interface along three axes: (i) a genomic position axis, 
which is now finely resolved to consensus DHS summits (centroids); 
(ii) a cell/tissue-state axis now captured in DHS components; and (iii) a 
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z-scores across 261 GWAS traits, shown for all baseline annotations (dashed 
grey line), top 15 DHS component-associated biosamples (solid grey line) and 
DHS components (black line). e, S-LDSC coefficient z-scores for selected traits 
(lupus, g= 0.002; maximum heart rate during fitness, g= 0.016; alcohol 
drinking status, q=0.009), shown for all biosamples (grey lines), top15 
component-associated biosamples (coloured ticks) and DHS components 
(coloured arrows). f, Stronger heritability contribution of component- 
concordant DHSs shown by stratifying S-LDSC z-scores by DHS types. Boxes, 
medians and IQRs (25-75%); whiskers, 1.5 x IQRs; n= 261 GWAS traits. Grey areas 
in d-findicate S-LDSC z-scores (S-LDSC coefficients, normalized using 
estimated standard errors) with P< 0.01; FDR-corrected g-values shown for 
traitsine. 


gene context axis that reflects the coherent co-localization of similarly 
regulated DHSs over gene bodies. The convergence of GWAS variants 
in coordinately regulated gene body DHSs suggests a fundamental 
feature of the genetic architecture of disease that has heretofore, to 
our knowledge, escaped notice. This finding resulted from combining 
the sharpened disease association and heritability signals enabled by 
high-precision annotation of regulatory DNA with the new ability to 
annotate the biological spectrum of each element, neither of which 
would have been possible without the large advances in biological scale 
and methodologies reported here. The fact that genetic association and 


heritability signals are concentrated across congruently regulated sets 
of DHSs that decorate entire gene bodies has important theoretical and 
practical implications for understanding both the genetic architecture 
of disease and the problem of connecting genetic signals with their 
target genes, whichis critical for therapeutic translation. 

More broadly, the framework we report represents a transition 
from an exploratory era focused on the discovery of novel elements, 
toa map-centric era with a focus on the detection of previously anno- 
tated elements within specific biological contexts (Extended Data 
Fig. 10a). The index framework may also obviate the need for peak 
calling (Extended Data Fig. 10b-e), and should prove particularly valu- 
able for anchoring single-cell studies“, which are presently at least 
1,000-fold too sparse for robust delineation of regulatory DNA within 
individual cells. 
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Methods 


Generation of DNase I hypersensitivity maps 

DNase | assays were generally performed according to a protocol 
detailed previously. This protocol involves treatment of intact nuclei 
with the small enzyme DNase | which is able to penetrate the nuclear 
pore and cleave exposed DNA. Small (<1 kb) fragments are isolated 
from lysed nuclei following DNase I treatment, linkers are added, and 
the resulting library is sequenced. Because tissue and cell culture, iso- 
lation, and handling protocols differ for different biosamples, these 
are indexed in Supplementary Table 1. Additional information on the 
procurement of biosample material and DNase-seq biosample selec- 
tion and data processing is available in the Supplementary Methods. 


Index of consensus human DHSs 

DHSs were detected in individual biosample datasets and integrated 
across all 733 datasets to yield a set of 3.59 million consensus DHS 
delineations. These elements were subsequently annotated with esti- 
mates of their centre-of-mass, positional stability across datasets and 
confidence scores. A detailed explanation of this procedure is provided 
in the Supplementary Methods. 


Overlap of the DHS index with genomic annotations. To assess 
the overlap of our DHS consensus elements with repetitive elements 
(Extended Data Fig. 2b), we obtained RepeatMasker® annotations 
downloaded from the University of California Santa Cruz (UCSC) Ta- 
ble Browser“, and considered the various repeat classes and (sub) 
families as provided. To perform analogous analyses for human gene 
annotations (Extended Data Fig. 2c), we obtained GENCODE* v.28 
Basic annotations. We defined exons as specified in the GENCODE an- 
notation, promoters as the TSS of genes +1 kb, and introns as the rest 
of the gene body. Intergenic regions were defined as those not covered 
by gene bodies or defined promoters. We assigned index DHSs to these 
annotations requiring at least al bp overlap, choosing the annotation 
with the largest overlap in case of multiple overlapping annotations. 

TOPMed within-human sequence variation data were obtained 
from the Bravo website (https://bravo.sph.umich.edu/freeze5/hg38/ 
download, Freeze 5, hg38, VCF format). We converted 495.6 million 
single-base substitutions to nucleotide diversity scores (7), witha 
score of zero implied for every genomic base position with no variants. 
Per base, phyloP** sequence conservation scores were downloaded 
as-is (http://ngdownload.cse.ucsc.edu/goldenpath/hg38/phyloP- 
100way/). Within-human sequence variation data (7 x 10*) and phyloP 
conservation scores were aligned relative to DHS centroids using 20-bp 
non-overlapping windows tiled across a 1-kb region centred on each 
centroid (Extended Data Fig. 2g). For each window offset relative to 
the DHS centroid, genome-wide per-base scores were subsetted using 
bedops” and averaged with GNU datamash. 


Saturation and extendability of DHS index. For random subsamples of 
sizes ranging from 1 to 733 biosamples, we estimated the mean number 
of novel DHSs added by a new dataset as a function of total number of 
datasets sampled (Extended Data Fig. 2h). To extrapolate these esti- 
mates to future biosample sets, we fitted a log-log model to the data. 
From the saturation analysis, we expect the overwhelming majority 
of DHSs identified in any new dataset to be represented already inthe 
index, to which they will contribute additional confidence and preci- 
sion. Incremental datasets can be added to the index by re-delineating 
DHSs using the original per-dataset DHS calls permanently recorded 
at the ENCODE DCC (Supplementary Table 1). 


Construction of a DHS vocabulary 

We used NMF*®” for the decomposition of a binary matrix consisting 
of the presence or absence calls of m DHSs across n DNase-seq data- 
sets into a smaller set of kcomponents. As with other dimensionality 


reduction methods, NMF does not guarantee a total recapitulation 
of the original data; instead we chose to allow information loss in 
exchange for a more interpretable result. Therefore, we considered 
using a much smaller number of k components than the lower of the 
two dimensions of our input matrix (733 DNase-seq datasets). To keep 
the reconstruction error in check, we used an objective function that 
is minimized subject to the Frobenius norm (Extended Data Fig. 3a). 
NMF typically uses a random initialization step, leading to unstable 
results. To alleviate this, we performed the initialization step using 
singular value decomposition (SVD)**”’, leading to consistent results 
while maintaining a performance thatis on par with randomly initialized 
instances. A more detailed rationale for the component-wise descrip- 
tion of DHSs, as well as details on the implementation and execution 
of the decomposition, is provided in the Supplementary Methods. 


Labelling of NMF components and DHSs. To aid interpretation of the 
16 NMF-derived components, we used two orthogonal approaches to 
assign labels to components, based on (i) biosample properties and 
(ii) DHS sequence features. 

First, for each component we selected the top biosamples based on 
component-specific NMF loadings present in their datasets (Extended 
Data Fig. 4a). These maximal NMF loadings across datasets were gen- 
erally strong across components (Extended Data Fig. 4b). In general, 
a clear pattern emerged of shared properties of biosamples most 
strongly associated with specific components. To formalize this, we 
performed one-sided Mann-Whitney U tests to assess whether NMF 
loadings for biosamples sharing certain metadata categories (Sup- 
plementary Table 1) are greater than those for biosamples not in the 
given metadata category (Extended Data Fig. 4c). In particular, we 
assessed metadata categories corresponding to human organ systems 
and the cancer status of biosamples. Pvalues were corrected for multi- 
ple hypothesis testing using the Bonferroni correction method. A post 
hoc analysis of biosample-to-component assignment for values of k< 
16 provided insight into the genesis of our k= 16 component model, 
showing junctures after which separate cell type lineages are captured 
by distinct components (Extended Data Fig. 4d). 

Second, for each component we obtained DHSs with maximal NMF 
loadings for that component, and subsequently performed enrichment 
analyses for TF binding site motifs (Extended Data Fig. 4e). We used a 
wide array of TF motifs and used FIMO™ (match threshold P< 10°), to 
search for motif instances in the human genome. We tested the asso- 
ciation of motif occurrences with specific NMF components using 
Fisher’s exact test. We used clusters of similar motifs (http://www. 
mauranolab.org/CATO/weblogos/main.html) for the purpose of sum- 
marization and visualization. The results show strong enrichments 
for component-specific motifs, suggesting preferential binding of 
component-relevant transcription factors (Extended Data Fig. 4f). 

The strong associations of 1) biosample properties and 2) TF bind- 
ing site occurrences with specific components enabled us to label 
each NMF component, resulting ina DHS vocabulary (Fig. 2d), further 
detailed in the Supplementary Note. For downstream analyses, we 
labelled each DHS with its strongest NMF component (Fig. 2e, bottom). 


Robustness of component interpretation. To test the effect of induc- 
ing additional sparsity in the NMF model, we systematically increased 
the L1 penalization setting while tracking Fl scores and the fraction of 
non-zero parameters used in the model (Extended Data Fig. 5a-c). The 
top 15 component-contributing biosamples per component remain 
mostly consistent with Fig. 2e and Extended Data Fig. 4a without L1 
penalization, indicating that enforcing additional sparsity does not 
impact the interpretation of model components. 

To test the effect of possible over/under-representation of certain 
celltypes, we removed 44 (40%) haematopoietic biosamples, consisting 
of the highest quality datasets representative of unique cellular condi- 
tions (Supplementary Table 1). After building anew NMF model, we 


observe that although the remaining (lower quality) haematopoietic 
biosamples are now captured by a single component instead of two, 
the interpretation of the remaining non-haematopoietic components 
does not change (Extended Data Fig. 5d). 


Regulatory annotation of human genes 

Per-component genomic distribution of DHSs. We compared the 
average distance between same-component DHSs against empirical 
distributions based on random assignment of component labels to 
DHSs and sampling the same number of DHSs 1,000 times (Extended 
Data Fig. 6a). 


Per-component meta-DNase tracks. To illustrate the regional di- 
versity of DHS component data, we generated meta-DNase tracks 
representing each of the 16 DHS components (Extended Data Fig. 6b) 
by averaging genome-wide DNase-seq signal profiles of the top 15 bi- 
osamples most strongly associated with each component (Extended 
Data Fig. 4a). For visual conciseness, we provide aggregate tracks that 
overlay the meta-DNase tracks of all DHS components (for example, 
Fig. 3c, Extended Data Figs. 6b, c, 7a-h, 8a-c). 


Definition of regulatory landscapes. We defined the regulatory land- 
scape of a gene as the set of DHSs within the gene body, plus DHSs 
in flanking regions of maximally 5 kb upstream and maximally 1 kb 
downstream of the gene body, or up until halfway through to the gene 
upstream, whichever value is smaller (Fig. 3a, Extended Data Fig. 6e-g). 
This captures approximately 65% of all DHSs (Extended Data Fig. 6d) 
and prevents flanking region DHSs from being routinely assigned to the 
regulatory landscapes of multiple genes, alleviating mixing of regula- 
tory signals. 


Association of genes with DHS components. We tested the associa- 
tion of all 56,832 annotated GENCODE genes (Fig. 3b) with each DHS 
component separately. Under the null hypothesis that DHS compo- 
nents are randomly distributed across gene regulatory landscapes, 
we used the binomial distribution to test whether the proportion of 
DHSs annotated with a given componentis higher among DHSs within 
a particular gene regulatory landscape than outside. We controlled 
the FDR at 5% by calculating g values” across the total of all genes and 
components. Further details are provided inthe Supplementary Meth- 
ods. To study the differences between a gene-centric and TSS-centric 
approach, we calculated component associations for 10-kb regions 
centred around the TSS (that is, TSS + 5 kb) and assessed the number 
and type of genes annotated (Extended Data Fig. 6h, i). 


Annotations for GENCODE genes and pseudo-gene types. GENCODE 
v.28 (Basic) annotations were used for all analyses. For the purpose 
of labelling and visualizing genes, for each gene we used its longest 
transcript as its representative region. Pseudo-gene annotations were 
obtained from psiCube™, http://pseudogene.org/psicube/data/gen- 
code.v10.pgene.parents.txt. 


Visualization of gene regulatory annotations. We used t-SNE to visu- 
alize the enrichment ratios of gene regulatory landscapes for DHS 
components (Fig. 3d, Extended Data Fig. 8a-c). Each dot shown repre- 
sents a gene found to be significantly associated with one or more DHS 
components, and the union of these are the genes used to calculate the 
2D embedding. The R (http://www.r-project.org) implementation as 
provided in the Rtsne package was used, with default parameters. Genes 
are coloured according to their (most strongly enriched) significant 
DHS component. 


Construction and use of gene expression compendium dataset. 
We used the full human ARCHS4 dataset (downloaded 26 June 2018)” 
and selected relevant tissue and cell types for each DHS vocabulary 


component (Supplementary Methods). This resulted in a total of 33,733 
unique gene expression datasets, with expression information for 
35,238 genes. For each gene, we obtained the 95th percentile value 
across datasets selected for each DHS component as the represent- 
ative value in that component, to not be led by outliers in the data, 
while still being sensitive for cell type selective expression levels. For 
each DHS component, we calculated average expression levels across 
genes labelled with that component (observed), as well as across all 
component-labelled genes (expected). Resulting values are reported 
as log,-transformed enrichment ratios (Fig. 3g). 


Annotation and visualization of pathway labellings. A curated 
set of canonical pathways was obtained from the MSigDB Collec- 
tions (http://software.broadinstitute.org/gsea/msigdb/genesets. 
jsp?collection=CP). Pathway enrichment analyses (Extended Data 
Fig. 8d, e) were performed analogously to gene enrichment analy- 
ses, by pooling DHSs in neighbourhoods of all pathway-associated 
genes. We used the KEGG® REST API (https://www.kegg.jp/kegg/rest/ 
keggapi.html) to download and graphically annotate KEGG pathway 
representations. 


Prioritization of TF-associated DHSs. We obtained DHSs with load- 
ings for a single component only. For each component-labelled TF 
gene with a known sequence binding motif, we obtained the subset 
of DHSs that (i) are annotated with the same componentas the TF, (ii) 
contain a TF-matching motif, and (iii) are footprinted in a biosample 
associated with the same component” (Fig. 3h). Although the above 
analysis identified a small minority of DHSs owing to stringent filter- 
ing, motifs with variable information content, and the smaller range 
of biosamples for which footprinting data are available, this approach 
could be recapitulated with less extreme parameters to identify larger 
sets of DHSs at reasonable confidence. 


Genetic variation analyses 

GWAS traits and summary statistics. We obtained GWAS summary 
statistics data from the UK Biobank project as processed by the Neale 
lab (http://www.nealelab.is/uk-biobank/). In addition, we obtained 
GWAS summary statistics calculated using BOLT-LMM v2.3™, as used 
in recent work®. 


Estimates of SNP-based heritability. GWAS traits were curated by 
removing those with a narrow-sense SNP-based heritability™ (h,’) of less 
than1%. Although ideally we would quantify heritability by considering 
the true causal effects of variants, in reality we do not observe these. 
Instead, we are limited to GWAS summary statistics, which essentially 
describe the marginal trait-correlation for each variant, consisting of 
both causal effects and effects due to LD, plus statistical noise. Recently 
proposed methods such as LD score regression (LDSC)* are able to 
estimate heritability while explicitly considering the underlying LD 
structure. For continuous traits, in case both raw and inverse-ranked 
normalized (irnt) versions were available, we retained the latter only. 
This yielded a total of 1,316 traits for subsequent analyses with an h,’ 
of at least 1%. 


Quantitative trait associations. For quantitative trait-versus- 
component analyses (Fig. 4a, Extended Data Fig. 9a-c), we assessed the 
correspondence between trait association strength (GWAS variant asso- 
ciation Pvalue) and the component annotations of variant-containing 
DHSs, for increasingly stringent subsets of GWAS variants. Enrichment 
Pvalues were calculated using a binomial distribution, as done previ- 
ously°. We explicitly control for large scale LD structure, using a form 
of LDclumping”, by selecting a single variant-containing DHS for each 
of 1,708 approximately independent LD blocks”. Namely, for each LD 
block, the variant with the lowest GWAS association P value that over- 
laps aDHS was selected for subsequent analysis. In case multiple such 
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variant-containing DHSs existed, we gave preference to the DHS with 
the highest confidence score (mean signal) in our DHS index. 


Stratified LD-score regression. To estimate h,” with maximal statis- 
tical power, we used LD score regression (LDSC)* to explicitly take 
into account LD structure. In particular, we used a stratified version 
of LDSC (S-LDSC)** to partition heritability estimates according to 
pre-defined sets of genome-wide annotations (Fig. 4b, c, Extended 
Data Fig. 9d, e), consisting of our annotated DHSs in addition to a 
wide range of 85 genome-wide functional ‘baseline’ annotations 
(baseline-LD model v.2.1). The v.2.1 baseline set consists of a total 
of 86 genome-wide annotations, building upon the 76 annotations 
used in the v.2.0 set and several additional annotations®. These ‘base- 
line’ annotations encode whether SNPs fall inside protein-coding or 
non-coding regions, regions with increased levels of evolutionary 
conservation, regions predicted or confirmed to have enhancer activ- 
ity, andsoon. Their breadth provides a robust® baseline model along 
which to test trait heritability contributions of our DHS components. 
We express the heritability enrichment of an annotation as the ratio 
of its proportion of per-trait h,” and the proportion of SNPs covered 
by the annotation (Fig. 4b). 

Variants included in the analysis are those registered in HapMap3, 
with a minimal minor allele frequency (MAF) of 5%, and excluding the 
human major histocompatibility complex (MHC) locus. Baseline LD 
scores were computed from 1000 Genomes Phase 3 data from European 
ancestry populations and corresponding allele frequencies (as used 
previously® and available from the LDSC reference downloads page, 
along with the baselineLD annotation set: https://data.broadinstitute. 
org/alkesgroup/LDSCORE/). 


Heritability enrichments for DHS vocabulary components. We 
applied S-LDSC to our DHS vocabulary components as follows. In 
brief, each DHS was assigned to its majority DHS component and 
(when possible) assigned to overlapping variants. For the resulting 
vocabulary-based annotations, LD scores were calculated. We then 
performed S-LDSC separately for each of the selected 1,316 traits, 
relative to these vocabulary-based annotations and the baselineLD 
model described above. For each trait versus annotation combination, 
we obtained estimates of its heritability enrichment’, expressed as 
the ratio of its proportion of h,’ and the proportion of SNPs covered 
by the annotation (Fig. 4b, c). We considered heritability enrichments 
statistically significant at an estimated FDR of less than 5% calcu- 
lated across all considered traits and DHS components. This is more 
stringent than the commonly used per-trait correction for multiple 
hypothesis testing. 


Unique per-annotation contributions to SNP-based heritability. Es- 
timates of heritability enrichment can be confounded by contributions 
of multiple (overlapping) genomic annotations included in S-LDSC 
models. To quantify unique per-annotation contributions to heritabil- 
ity, we obtained the average per-SNP increase in heritability ascribed to 
that component, after controlling for all other annotations in the model 
(baseline annotations and DHS components)**®. From the reported 
coefficients and their standard errors, we derived z-scores, one-sided 
Pvalues and FDR-corrected q values for each trait-versus-component 
combination (Fig. 4d, e). For the heritability analysis in component 
concordant genic DHSs (Fig. 4f), we further stratified DHSs based on 
whether they are component concordant, component discordant, 
inside non-annotated genes (genic controls), or inter-genic. Figure 4f 
shows z-scores for the maximally enriched components identified in 
Fig. 4c. 

To quantify the heritability contribution of per-dataset DHSs, we 
performed a variation on the standard S-LDSC procedure, as described 
previously”. Specifically, we built upon the baselineLD model by itera- 
tively considering annotations derived from individual datasets only. 


These individual datasets were collected by selecting for each trait 
the 15 datasets most informative to each DHS component (Extended 
Data Fig. 4a). Annotations consist of DHSs observed in those datasets, 
as well as their complement, that is, the remainder of index DHSs. We 
report the contribution to heritability based onthe former, expressed 
as z-scores (Fig. 4d, e). 


Extendability of the DHS vocabulary 

Addition of novel unseen datasets. New datasets may be added tothe 
current NMF model while retaining the same interpretation of compo- 
nents (Extended Data Fig. 10a). In brief, 0.1% FDR variable-width peak 
calls are obtained from new datasets of interest, mapped to DHS index 
elements using bedops” and projected into the existing component 
space using standard NMF routines (see code for more details). 


DHS index element identification without de novo peak identifica- 
tion. We used bedops“ to look up DNase-seq signal levels of a dataset 
of interest over index elements, to determine whether a given element 
is actuated in the dataset. Expressed as a classification problem, using 
the existing 0.1% FDR variable-width peak calls as the groundtruth set, 
we assess precision and recall of peak recovery. For all 733 biosamples 
we find area under precision recall curve (AUPRC) values ranging from 
0.33 to 0.83 (median, 0.71; IQR, 0.64-0.75), with a trophoblast biosa- 
mple (ENCODE DCC identifier ENCBS576QRR) shown as an example 
(Extended Data Fig. 10a). The large difference between AUPRC values 
of matched versus non-matched biosamples allows the identification 
of the original biosample (Extended Data Fig. 10b), while showing that 
biosamples with similar AUPRC ranks share the same biological charac- 
teristics (Extended Data Fig. 10c). This procedure can also be followed 
for unseen datasets (Extended Data Fig. 10d), in particular datasets that 
are less deeply profiled or would otherwise be too sparse to call peaks 
onde novo-—suchas single cell chromatin profiling data. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All primary data are available from the ENCODE DCC portal. Biosample 
metadata are available in Supplementary Table 1 as well as in other 
formats via Zenodo (https://doi.org/10.5281/zenodo.3838751). The set 
of more than 3.5 million DHS delineations is available in tab-separated 
format from the ENCODE DCC portal (https://www.encodeproject. 
org/annotations/ENCSR857UZV/) and via Zenodo (https://doi. 
org/10.5281/zenodo.3838751). Data matrices describing the occurrence 
patterns of DHSs across biosamples are available in various formats 
via Zenodo (https://doi.org/10.5281/zenodo.3838751). There are no 
restrictions on data availability and (re)use. We additionally provide 
a specialized data browser (https://index.altius.org/) and a trackhub 
for the UCSC Genome Browser (https://genome.ucsc.edu/cgi-bin/ 
hgTracks?db=hg38&hubUrl=https://resources.altius.org/~meuleman/ 
DHS _Index_tracks/hub.txt). BED files documenting the coordinates 
and annotations of DHSs with evidence of being bound by specific 
transcription factors are available via Zenodo (https://doi.org/10.5281/ 
zenodo.3838751), and top-scoring elements per TF can be explored in 
a browser (https://index.altius.org/?application=viewer&roiSet=T Fa 
ssoc_ Meuleman). 


Code availability 


Code is available on Github for building the index of consensus DHSs 
(https://github.com/Altius/Index), for constructing the DHS vocabu- 
lary and the addition of novel biosamples (https://github.com/Altius/ 
Vocabulary). 
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coordinates according to these clusters (step 4), and delineate DHSs using the number of datasets in whicha DHS was observed. 


a b 
3,591,898 DHSs 3% 
| exon y 
17% exon non-coaing 1% 
2% 
2% 
1% 
14% 
intron 
coding gene 42% 
138% 
repeats genes 5% 
(54%) (62%) 3% —[ "Simple repeats non-coding | 11% 
1% 
54% 
d e f 
250k © 
= — Full width 


#DHSs 


— Core width 


a 3S a 8 
oO is) 3 S 
oO ~ x x x 
relatitive frequency 
0.0 0.2 0.4 
relative frequency 
0.000 i i 0.012 


10 100 10° 10* 10° 10° 


r 1 1 1 : 1 ; 
0 2x10* 4x10* 6x10* 8x10* 10° 0 0 100 200 300 400 500 600 
distance to closest TSS distance to closest TSS (log10) DHS (core) widths 
g © h i 
NR os 100,000 + | 1% dataset: 8 g 64 
hare a +104,062 DHSs & = 
2 r og ® 50,000 | S s 
a po} 
co os a 5 8 5 | 
=, Ti 2 oO @ er Ss 
or ra) > 20,000 + ed @ S 
SY. oO ro wR t = 
26 Le = = 8 
& | =s S 10,000 + = 44 
= 2 5 B 
> [OSE 2 = 
oy. oO 5 € 5,000 4 Fa 
e° = 2 = 3 
eo | rf - 5 1,466" dataset: 
oF $ ene + ~ __+1,048 DHSs 
| lage = 733" dataset: ee 2 J 
o oo 1,000 4 +1,676 DHSs asi 
, 
-400 0 400 0) 500 1000 1500 0 200 400 600 


position relative to DHS centroid 


Extended Data Fig. 2 | Genomic context of DHS index elements. a, Overall 
coverage of 3.5M+ DHSs across genes and repetitive elements. b, Coverage of 
classes and families of repetitive elements. c, Coverage of annotated genic 
regions. d, Barplot of the number of DHSs asa function of distance to the 
nearest annotated transcription start site (TSS), up to 100,000 base pairs. 

e, Density plot of DHS distance to the closest TSS for all index DHSs, showing 
that the vast majority of DHSs are found distal to annotated promoters. 

f, Density plot of element widths for full DHSs and their core regions only, 
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shown for DHSs observed in more than one biosample. Uniform 20 bp jitter 
added for smoothness. g, DHS centroids show an increase in sequence 
conservation (phyloP) and a decrease in within-human sequence variation 
(TOPMed, 7x 10*). h, Mean number of new DHSs observed asa function of the 
nth DNase-seq dataset added, shown for the first 733 observed biosamples, as 
well as for an extrapolation to an additional future 733 new biosamples. 

i, Histogram indicating the variety in cell type selectivity of DHSs, ranging from 
single cell types to groups of 10s, 100s or even all assayed cellular conditions. 
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c, Colour-based view from the values shown in b. Colours indicate relative 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4| Association of DHS components with cellular 
conditions and TF motifs. a, Bar plots showing for each NMF component the 
top 15 DNase-seq datasets in terms of NMF loadings. NMF loading strength 
(x-axes) and dataset labelling (y-axes) are indicated. b, Box plots showing for 
each NMF componentits loadings across those biosamples for which that 
componentis maximally loaded. Boxes denote medians and interquartile 
ranges (IQRs, 25-75%), whiskers represent 1.5 x IQRs, n=18,57,46,27,52,23,34, 
49,40,107,33,27,54,40,36,90 biosamples, respectively. c, Beyond thetop15 
biosamples for each component, general associations of components with 
annotations regarding human organ systems and cancer. Indicated are 


Bonferroni corrected P values, resulting from one-sided Mann-Whitney U 
tests. d, Distribution of biosamples across (maximal) NMF components, for 
the number of components (k) ranging from 2 to 16. Labels at the top indicate 
at which point distinct lineages became represented in corresponding 
components. e, Enrichment of transcription factor (TF) binding motifs in DHS 
components. Greyscale values indicate enrichment levels, only statistically 
significant results are included. DHS components shown on the x-axis, TF motif 
clusters with top representative motif onthe y-axis. f, Top enriched TF motifs 
for each DHS component. 
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Extended Data Fig. 5 | See next page for caption. 


Article 


Extended Data Fig. 5| DHS component robustness. a, Flscoreasa function 
of L1 penalization levels (4), with separately indicated levels of sparsity 
reflected by the percentage of non-zero parameters in the resulting models. 
Shaded area represents penalization levels resulting in comparable 
16-component models, as opposed to models with effectively less than 16 
components, which are discarded in subsequent analyses. b, All biosamples 
with non-zero NMF loadings in the cardiac DHS component (for 4= 0). 
Horizontal line separates the top 15 biosamples (yellow shading) from the rest 
(shades of green), where green shading indicates quantile ranking in terms of 


component loading strength. c, Biosamples with non-zero NMF loadings for 
each DHS component, extended with agreement of quantile ranking asa 
function of L1 penalization levels, indicating that these rankings stay near 
constant for most components. d, Top 15 biosamples in terms of NMF loading 
per DHS component for an alternative NMF model resulting from a40% 
downsampling of high-quality haematopoietic biosamples. NMF loading 
strength (x-axes) and dataset labelling (y-axes) are indicated, only for 
components that differ with the final model. 
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Extended Data Fig. 6| Clustering of same-component DHSs near genes. 

a, Component-specific genomic clustering of DHSs, as shown by the median 
distance between same-component DHSs as compared to the median distance 
after random permutation of DHS-component labels. b, Regulatory landscape 
+/-50kb around the GATAI gene, indicating GENCODE gene annotations, 
meta-DNase tracks for individual DHS components (Methods), anda 
meta-DNase overlay track. c, Detailed view, restricted to the GATA1 regulatory 
landscape, including its delineated and annotated DHSs. Collectively, this 
landscape shows astatistical over-representation of DHSs associated with the 
myeloid/erythroid (red) component. d, Density plot of DHS distance to the 
closest TSS for all Index DHSs (black line) and the subset (65%) of DHSs 
considered for the purpose of annotating genes using DHS components. 


e, f, Alignment plots showing DHS summit density across the transcription 
start sites (TSSs, e) and transcription termination sites (TTSs, f) of annotated 
genes. Shaded areas indicate regions included for the purpose of annotating 
genes using DHS components. g, DHS density expressed in terms of number of 
DHSs per kilobase, indicating a general enrichment of DHSs in and immediately 
surrounding genes. h, Venn diagram showing the overlap between regulatory 
annotations based on the gene-centric approach described inthis work anda 
TSS-centric approach (+/—Skb). The gene-centric approach captures the vast 
majority of genes annotated using the TSS-centric approach, while adding an 
additional approximately 11,000 genes. i, Type of genes annotated using a 
gene-centric versus TSS-centric approach, showing the former yielding larger 
fractions of protein-coding and long non-coding genes. 
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Extended Data Fig. 7 | Top labelled genes for selected components. 

a-d, Top-scoring protein-coding genes per DHS component reflect their 
functional roles, as shown for lymphoid (a), myeloid / erythroid (b), stromal (c) 
and tissue-invariant (d) components. e-h, Top-scoring transcription factor 
(TF) genes per DHS component reflect their functional roles, as shown for 
placental (e), cardiac (f), digestive (g) and organ developmental/renal (h) 


components. Full gene regulatory landscapes used for labelling are shown, 
with GENCODE gene annotations, meta-DNase overlay track, and DHSs. 

i, Examples of component-annotated genes with discordant expression 
patterns. Coloured squares next to gene names indicate relevant components, 
inthis particular case discordant with cell and tissue types with maximal 
expression. 
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Extended Data Fig. 8 | See next page for caption. 


Article 


Extended Data Fig. 8 | Annotation of genes with unknown function and 
pathways. a-c, Two-dimensional projection coordinates generated using 
t-SNE onall genes significantly associated with aDHS component and shown 
selectively for subsets of gene categories, namely transcription factors (TFs; 
diamonds: ZNF TF genes) (a), lincRNA genes (b) and pseudo-genes (c). 
Indicated are the number of labelled genes in each combination of gene 
category and DHS component. Examples of labelled genes are shown as 
follows. a, Regulatory landscape of ZNF33/; a poorly annotated zinc-finger 
(ZNF) TF gene (lymphoid and placental components). b, Regulatory landscape 


of BANCR; along intergenic non-coding RNA (lincRNA) gene, recently 
associated with cardiomyocyte migration. c, Regulatory landscape of the 
pseudo-gene/GHGP (lymphoid component). d, DHS component labelling of 
MSigDB canonical pathways, through the regulatory landscapes of constituent 
genes. Shownare pathways witha significant association (5% FDR) and 

an observed/expected ratio of at least 2. The most strongly associated 
components are indicated for each pathway, with their source databases. 

e, Examples of three component-associated pathways from the KEGG database, 
with genes coloured according to their majority component. 
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Extended Data Fig. 9 | GWAS trait associations of DHS components. 

a-c, Quantitative association of component-DHSs with GWAS traits 
reticulocyte count, pulse rate, and FEV1/FVC ratio. Canonical genome-wide 
significance threshold indicated (5 x 10°). a, Enrichment ratios for increasingly 
stringent subsets of variants, per DHS component. b, Nominal enrichment — 
log,o(P value) of a one-sided binomial test for each DHS component. c, Nominal 
enrichment -log,o(p-value) of a one-sided binomial test for the strongest DHS 
component only, along with its strongest associated biosamples.d, GWAS 
traits associated with component-annotated index DHSs. e, Greyscale values, 


heritability enrichment levels for statistically significant (FDR 1%) traits based 
onthe full delineated width of index DHSs (left) and restricted to index DHS 
‘core’ regions (right). Row labelled as per d. f, Ratio of heritability enrichment 
values for ‘core’ versus ‘full size’ DHSs. g, DHS confidence scores (mean signal) 
stratified according to gene landscape DHS types. Boxes denote medians and 
interquartile ranges (IQRs, 25-75%), whiskers represent 1.5 x IQRs, n=261GWAS 
traits. h, Heritability enrichments stratified according to gene landscape DHS 
types. Greyscale indicates heritability enrichment levels for statistically 
significant associations (1% FDR). 
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Extended Data Fig. 10 | Extendability of the DHS annotation framework. 

a, Two-dimensional UMAP projection of 733 biosamples by way of their index 
DHS utilization, coloured by their strongest representative NMF component. 
Stars indicate the embeddings of 38 previously unseen immune-related 
DNase-seq peak call datasets. b, Area under precision recall curve (AUPRC) 
values for predicting per-biosample DHSs from DNase-seq signal alone, shown 
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patterns of DHSs across biosamples are available in various formats via Zenodo (https://doi.org/10.5281/zenodo.3838751). There are no restrictions on data 
availability and (re)use. We additionally provide a specialized data browser (https://index.altius.org/) and a trackhub for the UCSC Genome Browser (https:// 
genome.ucsc.edu/cgi-bin/hgTracks ?db=hg38&hubUrl=https://resources.altius.org/~meuleman/DHS_Index_tracks/hub.txt). BED files documenting the coordinates 
and annotations of DHSs with evidence of being bound by specific transcription factors are available via Zenodo (https://doi.org/10.5281/zenodo.3838751), and 
top-scoring elements per TF can be explored in a browser (https://index.altius.org/?application=viewer&roiSet=TFassoc_Meuleman). Various databases were 
consulted to obtain gene expression data (ARCHS4, https://amp.pharm.mssm.edu/archs4/download.html, accessed June 26, 2018), gene annotations (GENCODE 
v28 https://www.gencodegenes.org/human/release_28.html, psiCube http://pseudogene.org/psicube/data/gencode.v10.pgene.parents.txt) and pathway 
annotations (MSigDB v6.1 https://www.gsea-msigdb.org/gsea/msigdb/collections.jsp, KEGG v89.1 https://www.genome.jp/kegg/). 
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Mycoplasma contamination 


Commonly misidentified lines 
(See ICLAC register) 


The source of all cell line material is described (when available) in Supplementary Table 1 and the online Google Spreadsheet. 
Cell lines were procured from appropriate commercial sources. h.ESC lines used were from NIH approved list and provided by 
laboratories with expertise in growing, characterizing and differentiating these cell types. (see ENCODE website for details 
and protocols). 


Authentication was provided by the commercial vendors from which the cell lines were procured and in accordance with 
ENCODE policies. Beyond this, no cell lines were further authenticated. 


Cell lines were not routinely checked for mycoplasma contamination. 


One included cell line (SK-N-MC) appears on the ICLAC list, and we properly document this biosample as Ewing’s sarcoma as 
opposed to a neuroepithelioma biosample. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics 


Recruitment 


Ethics oversight 


Population characteristics and other biosample metadata are described (when available) in Supplementary Table 1 and the 
online Google Spreadsheet. Human primary tissues were collected under IRB approved protocols using Open-access 
consents, as per NHGRI/ENCODE policies. Care was taken to ensure sex parity for tissues collected, to the extent possible. All 
primary tissues processed by our center were received as anonymized samples with minimal metadata information about 
tissue type, age, sex and ethnicity of the donor. As such, our institutional IRB reviewed our study using anonymized primary 
tissue and cell samples, and classified it as non-human subject studies. 


Donors were recruited by collection centers (like Fred Hutch and dbGaP), using IRB approved Open-access consents. We 
received anonymized 


University of Washington IRB 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


=) 
jad) 
= 
e 
= 
o 
= 
o 
Ww 
© 
fev) 
S 
a 
=r 
= 
o 
G 
Oo 
a 
5 
a 
a) 
(= 
5 
=: 
fev) 
5 
S 


Article 


Rescue of oxytocin response and social 
behaviour ina mouse model of autism 


https://doi.org/10.1038/s41586-020-2563-7 


Received: 20 May 2019 & Peter Scheiffele"™ 


Accepted: 25 May 2020 


Hanna Hérnberg’, Enrique Pérez-Garci’, Dietmar Schreiner’, Laetitia Hatstatt-Burklé’, 
Fulvio Magara’, Stephane Baudouin**, Alex Matter®, Kassoum Nacro®, Eline Pecho-Vrieseling” 


Published online: 5 August 2020 


® Check for updates 


A fundamental challenge in developing treatments for autism spectrum disorders is 
the heterogeneity of the condition. More than one hundred genetic mutations confer 


high risk for autism, with each individual mutation accounting for only a small 
fraction of cases’ *. Subsets of risk genes can be grouped into functionally related 
pathways, most prominently those involving synaptic proteins, translational 
regulation, and chromatin modifications. To attempt to minimize this genetic 
complexity, recent therapeutic strategies have focused on the neuropeptides 
oxytocin and vasopressin‘ ®, which regulate aspects of social behaviour in mammals’. 
However, it is unclear whether genetic risk factors predispose individuals to autism as 
aresult of modifications to oxytocinergic signalling. Here we report that an 
autism-associated mutation in the synaptic adhesion molecule M/gn3 results in 
impaired oxytocin signalling in dopaminergic neurons and in altered behavioural 
responses to social novelty tests in mice. Notably, loss of N/gn3 is accompanied by a 
disruption of translation homeostasis in the ventral tegmental area. Treatment of 
Nlgn3-knockout mice with a new, highly specific, brain-penetrant inhibitor of MAP 
kinase-interacting kinases resets the translation of mRNA and restores oxytocin 
signalling and social novelty responses. Thus, this work identifies a convergence 
between the genetic autism risk factor Nign3, regulation of translation, and 
oxytocinergic signalling. Focusing on such common core plasticity elements might 
provide a pragmatic approach to overcoming the heterogeneity of autism. Ultimately, 
this would enable mechanism-based stratification of patient populations to increase 
the success of therapeutic interventions. 


Social recognition and communication are crucial elements in the 
establishment and maintenance of social relationships. Oxytocin and 
vasopressin are two evolutionarily conserved neuropeptides with 
important functions in the control of social behaviours, in particular 
pair-bonding and social recognition’*. In humans, genetic variation 
of the oxytocin receptor (OX7R) gene is linked to individual differ- 
ences in social behaviour’. Consequently, signalling modulators and 
biomarkers for the oxytocin or vasopressin system are being explored 
for conditions with altered social interactions such as autism spectrum 
disorders (ASDs)°*. In mice, mutation of the genes encoding oxytocin 
or its receptor results in a loss of social recognition and social reward 
signalling’ “. Mutation of Cntnap2, a gene linked to ASD in humans, 
resulted in reduced levels of oxytocin in mice, and the addition of oxy- 
tocin improved social behaviour in this model. However, the vast 
majority of genetic risk factors for autism have no known links to oxy- 
tocinergic signalling’ *”*. 

Here we explored oxytocin responses in mice that recapitulate a loss 
of function in the autism risk gene Nign3” ”. Nign3 encodes a synaptic 


adhesion molecule””?, and Nign3 mutant mice exhibit a range of behav- 


ioural alterations, including motor stereotypies”*™, alterations in social 
novelty preference” ”8 (but see refs. 7**°), social reward”, and responses 
to social novelty tests*!. Despite these alterations in social behaviours, 
adult Nign3-knockout (Nign3) mice exhibit normal responses to inani- 
mate objects”*”.. In a five-trial social habituation/recognition task, we 
observed that the social novelty response phenotype is established 
already injuvenile Mign3” mice (Fig. 1a—-c, Extended Data Fig. 1a, b, Meth- 
ods). The recognition and response to unfamiliar conspecifics depends 
onseveral neural circuit elements, including dopaminergic cells inthe 
ventral tegmental area (VTA DA neurons)’*" **, Re-expression of Mign3 
selectively in dopaminergic cells restored social novelty responses in 
juvenile Nign3® mice (Fig. 1d—-f, Extended Data Fig. 1c, d). By contrast, 
selective inactivation of Nign3in VTA DA neurons was sufficient to affect 
social novelty responses (Fig. 1g-i, Extended Data Fig. le-h). Social rec- 
ognition inthis assay depends onthe function of the oxytocin receptor, 
as treatment of wild-type mice with the oxytocin receptor antagonist 
L-368,899 impaired recognition (Fig. 1j-1, Extended Data Fig. li, j). 
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Fig. 1| Oxytocin response is altered in VTA DA neurons lacking Nign3. 

a, Five-trial social habituation/recognition task. sland s2 denote first and 
second social stimulus, respectively; 2’ and 5’ denote 2and5 min.b,c, Mean 
social interaction time (b) and social recognition index (c) in wild-type (WT) 
and N/gn3* mice. Numbers on graphs represent mice. d, Nign3*° mutation 
containing a /oxP-flanked transcriptional stop cassette. In DAT-cre mice, Nign3 
is selectively re-expressed in dopaminergic neurons. e, f, Mean social 
interaction time (e) and social recognition index (f) in DAT-cre and 
DAT-cre::Nign3® mice. g, VTA adeno-associated virus (AAV) injection in DAT-cre 
mice for microRNA-mediated knockdown of Nign3in VTA-DA neurons 
(VTA::DA-NL3). h, i, Mean social interaction time (h) and social recognition 
index (i) for VTA::DA-NL3 or control (VTA::DA-miR) mice.j, Treatment with the 
oxytocin receptor antagonist (OXTR-A) (L-368,899 10 mg kg“ intraperitoneally 
in saline). k,l, Mean social interaction time (k) and social recognition index (I) 
for mice treated with OXTR-A or vehicle. m, Retrobeads were injected in the 


The function of VTA DA neurons in social novelty responses and 
reinforcement is in part dependent on an oxytocin-induced increase 
inneuronal firing. Specifically, oxytocin released from axons that arise 
from hypothalamic nuclei increases the firing of VTA DA neurons pro- 
jecting to the nucleus accumbens (NAc)”"’. Thus, we examined whether 
loss of Nign3 might affect the response of this population of neurons to 
oxytocin. We marked VTA DA neurons projecting to the NAc medial shell 
and performed electrophysiological recordings from back-labelled 
neurons in acute slices of the VTA (Fig. Im, n). Consistent with previous 
reports, back-labelled neurons showed low hyperpolarization-activated 
cation currents (/,), and there was no significant difference between 
genotypes in/, and other basic biophysical properties (Extended Data 
Fig. 2). In cell-attached recordings, the frequency of baseline firing of 
VTA DA neurons from Mgn3” mice was slightly reduced compared with 
wild-type mice (Fig. 1p). Notably, bath application of 1 1M oxytocin 
significantly increased firing frequency in cells from wild-type mice 
but had no effect in slices from Nign3” mice (Fig. 1o, q). These findings 
uncover a requirement for the autism risk factor Nign3 for oxytocin 
responses in the VTA. 

Aloss of oxytocinergic neurons has been reported in knockout mice 
for the autism risk factors Cntnap2and Shank3b*. In Nign3 mice, we 
did not detect any alteration inthe density of oxytocinergicneuronsin 
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NAc medial shell, and tissues were prepared for slice physiology. n, Retrobead 
injection site in the NAc (top) and tyrosine hydroxylase (TH) immunoreactivity 
and retrobead labelling in biocytin-filled VTA neurons (bottom) after 
recording. o, Example traces of spontaneous firing at baseline (top) and after 
bath application of 11M OXT (bottom) in VTA DA neurons from wild-type (left) 
and Nign3” mice (right). p, Frequency at baseline in TH-positive wild-type and 
Mgn3® VTA DA neurons. Wild typen=22 cells from 12 mice; Nign3® n=14 
neurons from 8 mice. q, Average firing frequency (Hz) after bath application 
with OXT in retrobead-labelled TH-positive cells in the VTA of wild-type and 
Mgn3 mice. The Pvalues shownare from within group comparison of 
baseline versus OXT after 3 min. Wild typen=8 cells from7 mice; Nign3*° n=10 
neurons from7 mice. Error bars denote s.e.m. Pvalues determined by 
repeated-measures two-way ANOVA followed by Bonferroni's post hoc test 
(b,e,h,k, q), unpaired two-sided t-test (c, f, i, p) or two-sided Mann-Whitney 
test (I). See Supplementary information for additional statistics. 


the paraventricular nucleus (one of the major oxytocinergic nuclei) orin 
the density of oxytocinergic fibres in the VTA (Fig. 2a, b, Extended Data 
Fig. 3a—c). Fluorescent in situ hybridization (FISH) analysis revealed 
a slight increase in Oxtr mRNA in VTA DA neurons from Nign3” mice 
compared with wild-type animals (Fig. 2c, d, Extended Data Fig. 3d-f). 
However, targeted proteomics (parallel reaction monitoring) on micro- 
dissected VTA tissue did not detect significant alterations in total oxy- 
tocin receptor protein, with no change in vasopressin 1A receptor mRNA 
(Aupria) or protein (Extended Data Fig. 3g-j, Supplementary Table 1). 
Thus, we performed shot-gun proteomics for an unbiased identifica- 
tion of molecular alterations in the VTA of Nign3’° mice (Extended 
Data Fig. 4a—-c, Supplementary Table 2). Gene Ontology (GO) and 
network-based functional classification analysis for proteins altered 
in Nign3® VTA identified protein transport, cell adhesion, and mRNA 
translation as main categories (Extended Data Fig. 4b, c). Dysregulation 
of membrane trafficking and G-protein-coupled receptor (GPCR) signal- 
ling components is consistent with the previously discovered roles for 
Nign3insynapse organization and GPCR signalling”>>*. However, the 
alterations in regulators of translation were surprising. Alterations in 
mRNA translation have been linked to deficits in neuronal plasticity. We 
previously observed that behaviourally induced plasticity is altered in 
VTA DAneurons of Nign3” mice*!. Thus, we compared translationin VTA 
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Fig. 2| Disruption of translational regulation inthe VTA of Mign3 mice. 

a, Representative image of OXT (red) and DAPI (blue) immunofluorescence in 
the PVN of wild-type and Nign3”’ mice. b, Mean OXT-positive cells per section 
in the PVN in wild-type and Nign3* mice. n=3 mice per genotype; numbers on 
bars indicate sections analysed. c, Representative FISH images in the VTA from 
wild-type (top) and Nign3”° (bottom) mice using probes for Th (red), Nign3 
(green) and Oxtr (magenta). d, Quantification of mean fluorescence intensity 
per Th* cellinthe VTA from wild-type and Nign3” mice for Nign3 (left) and Oxtr 


(right). n: wild type =280 cells from 4 mice; Nign3*° = 265 cells from 3 mice. AU, 
arbitrary units. e, Scheme for AHA incorporation on slice preparations from 
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DA neurons of naive and behaviourally exposed wild-type and Nign3 
mice. Incorporation of the methionine analogue azidohomoalanine 
(AHA)*® in VTA DA neurons (marked by tyrosine hydroxylase) of acute 
mouse brain slices from naive Nign3® mice was reduced compared with 
wild-type mice (Fig. 2e-g, Extended Data Fig. 4d). Notably, the incor- 
poration of AHA was increased in VTA DA neurons from Mgn3 mice 
exposed to handling (Fig. 2e-g). This suggests a signalling-dependent 
disruption of translation homeostasis in the mutant mice. 

Disruption of translation homeostasis is thought to broadly modify 
neuronal proteins resulting in impaired plasticity and neurodevelop- 
mental conditions*®* *°. Thus, we sought to normalize translation in 
Nign3"° mice and test whether this would restore oxytocin responses 
in VTA DA neurons. We focused on MAP kinase-interacting kinases 
(MNKs), which are crucial regulators of signalling-dependent modifi- 
cation of mRNA translation (Fig. 3a). Inhibition of MNK was reported 
to modify ribosomal protein levels and to ameliorate behavioural and 
plasticity alterations in Fmr1 knockout (Fmr1*°) mice*’. We tested 
efficacy and brain penetrance of a series of highly specific MNK 
inhibitors originally developed for oncology applications“. One of 
these compounds, ETC-168, resulted in a dose-dependent reduc- 
tion in phosphorylation of the MNK target elF4E in cultured cortical 
mouse neurons without affecting protein levels of elF4E or elF4G or 
the activity of ERK1/2 (Fig. 3b, Extended Data Fig. 5a—e). In wild-type 
mice, orally administered ETC-168 had significant brain penetration 
(brain-to-plasma ratio 0.12) witha half-life of 4.3 h (Fig. 3d), and resulted 
inareductioninelF4E phosphorylationin the VTA and other brain areas 
(Fig. 3c, Extended Data Fig. 5f, |). Of note, phospho- or protein levels of 
elF4E or MNK1are unaltered in the VTA of Nign3 mice (Extended Data 
Fig. 5g-k). As a proof of concept, we probed the effectiveness of ETC- 
168 treatment in modifying a behavioural phenotype in Fmr1® mice. 
Inaplace-independent discrimination task, Fmr1® and wild-type mice 
learned a cue-reward contingency rule equally well, but Fmr1*°? mice 
were impaired on contingency reversal (Extended Data Fig. 6a, b). Treat- 
ment with ETC-168 during the reversal phase of the task significantly 
improved their performance (Extended Data Fig. 6c, d). Thus, ETC-168 
isa highly selective, brain-penetrant inhibitor of MNK1/2 activity, which 
modifies cognitive behaviour in mice. 

We then asked whether ETC-168 treatment would modify the trans- 
lational machinery in the VTA of Nign3*° mice. Using tandem mass 
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untreated and vehicle-treated mice. f, Representative images of acute slice 
measurements of protein synthesis in VTA visualized by AHA incorporation 
(green) with marking of TH-positive cells (red). g, Quantitative assessment of 
AHA incorporation in naive mice versus mice treated by oral gavage with 
vehicle. Wild-type untreated n=3 mice; Nign3 untreated, Nign3 vehicle and 
wild-type vehicle n=4 mice. Numbers on graph refer to the number of images 
analysed (8-20 VTA DA cells per image, approximately 10 images per mouse). 
Allerror bars ares.e.m. Pvalues determined by two-sided Mann-Whitney test 
(b, d) or Kruskal-Wallis test followed by Dunn’s multiple comparison test (g). 
See Supplementary information for additional statistics. 


tag (TMT)-based isobaric labelling, we verified the de-regulation of 
translational machinery in VTA tissue from Nign3” versus wild-type 
mice treated with vehicle (Extended Data Fig. 7a—d). Gene set enrich- 
ment analysis* uncovered an increase in core proteins of cytoplasmic 
but not mitochondrial ribosomes (Fig. 3e, Extended Data Fig. 7e, g). 
Inhibition of MNK with ETC-168 abolished the increase in core riboso- 
mal proteins in Nign3 VTA (Fig. 3f, Extended Data Fig. 7f, h). In acute 
slices from Nign3 mice treated with ETC-168, incorporation of AHA 
was significantly reduced as compared with Mign3” mice treated with 
vehicle, resulting in similar AHA incorporation levels to slices from 
vehicle-treated wild-type mice (Fig. 3g-i). 

We then tested whether ETC-168 treatment restored oxytocin 
responses and social novelty responses in Nign3’ mice (Fig. 4a). Nota- 
bly, short-term oral treatment (two applications of 5 mg kg‘ ETC-168 
over 26 h) of Nign3 mice recovered the oxytocin-induced increase 
in firing frequency seen in wild-type mice (Fig. 4b, c). This treatment 
also fully restored social novelty responses, with no detectable effect 
on wild-type mice (Fig. 4a, d—-f, Extended Data Fig. 8). This pharmaco- 
logically recovered social recognition behaviour was dependent on 
oxytocin receptor function (Extended Data Fig. 9). Notably, ETC-168 
treatment was well tolerated and remained effective in a long-term 
treatment regime (Extended Data Figs. 7a—c, 10). Thus, modification 
of translation homeostasis in Nign3’’ mice by MNK inhibition restores 
oxytocin responses and social novelty responses. 

This work uncovers an unexpected convergence between the genetic 
autism risk factor Nign3, translational regulation, oxytocinergic sig- 
nalling, and social novelty responses. Although loss of Nign3 impairs 
oxytocin responses in VTA DA neurons, the behavioural phenotype does 
not fully phenocopy genetic loss of oxytocin. Oxytocin knockout mice 
exhibit impaired habituation in the social recognition task”, whereas 
Nign3* mice habituate normally but exhibit a selective deficit in the 
response to a novel conspecific. This is probably due to differential 
roles of Nilgn3 and oxytocin across several neural circuits and over 
development. Moreover, Nlgn3 loss-of-function also affects signalling 
through additional GPCRs”. 

We propose that pharmacological inhibition of MNKs may pro- 
vide anew therapeutic strategy for neurodevelopmental conditions 
with altered translation homeostasis. Notably, MNK loss-of-function 
appears to be overall well tolerated. MNK1/2 double-knockout mice 
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Fig. 3 | The novel MNK1/2 inhibitor ETC-168 rescues translation in Nign3? 
VTA. a, ETC-168 targets MNK1/2. Note that elF4E phosphorylation decreases 
affinity of elF4E for the mRNA S’ cap structure b, Representative western blot 
(top) and quantification (bottom) of elF4E phosphorylation in days in vitro 14 
cortical neurons treated with DMSO or ETC-168 for 3h. n=8 replicates, 3 
independent experiments. c, Representative western blot (top) and 
quantification (bottom) of eIF4E phosphorylation in VTA lysate from 
wild-type mice treated with vehicle or 5 mg kg“ ETC-168. Numbers on graphs 
represent mice. d, Pharmacokinetic analysis of ETC-168 concentration in male 
mice (n=27) after asingle oral dose of 10 mg kg”. Plasma levels are in red, and 
brain levels are in blue. The half-life (¢,,.) and the brain-to-plasma exposure 
(AUC, i.) are Shown. e, Tandem mass tag (TMT) proteomic comparison of VTA 
from vehicle-treated mice (n=4 mice per genotype and treatment). Relative 
frequency of log,-transformed fold change in all detected proteins, cytosolic, 
or mitochondrial ribosomal protein abundance is plotted (Nign3"° versus wild 
type). f, Comparison as ine for log,-transformed fold change in protein 
abundance in ETC-168-treated Nlgn3” versus vehicle-treated wild-type mice. 
g, FUNCAT assay in acute slices from vehicle-treated or ETC-168-treated mice 
(5mg kg" or vehicle by oral gavage). h, Representative examples of AHA 
incorporation (green) in TH-positive cells (red). i, Quantitative assessment 

of AHA incorporation in vehicle (as in Fig. 2g for comparison to untreated 
mice) versus ETC-168-treated wild-type and Mign3” mice. n=4 mice per 
genotype and treatment. Numbers on graphs refer to number of images 
analysed. Error bars denote s.e.m. Pvalues determined by one-way ANOVA 
followed by Bonferroni's post hoc test (b), two-sided unpaired t-test (c), 
Kolmogorov-Smirnov test (e, f), or Kruskal-Wallis test followed by Dunn’s 
multiple comparison test (i). See Supplementary information for additional 
Statistics. 
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Fig. 4| MNK inhibition restores social novelty responses in Mgn3” mice. 
a, Scheme for drug treatment and analysis. b, Firing frequency at baselinein 
VTA DAneurons from NMgn3 mice untreated (as previously shown in Fig. 1p), 
treated with vehicle, or treated with 5 mg kg‘ ETC-168. Untreated n=14 neurons 
from 8 mice; vehiclen=8 neurons from 4 mice; ETC-168 n=14 neurons from 8 
mice. The wild-type mouse data from Fig. Ip is presented for comparison 
(n=22).c, OXT-induced frequency changes over time in VTA DA neurons from 
Mgn3" mice treated with vehicle or 5 mg kg‘ ETC-168. Wild-type mice from 
Fig. 1q are presented for comparison. P values shown are for baseline versus 
OXT at 3 min. Vehicle-treated n=8 cells from 4 mice; ETC-168-treatedn=11 
neurons from 6 mice. d, e, Mean social interaction time in wild-type (d) and 
Nign3* (e) mice treated with vehicle or 5mg kg‘ ETC-168. f, Social recognition 
index for wild-type and Nign3®° mice treated with vehicle or 5 mg kg‘ ETC-168. 
Numbers in brackets indicate mice. All error bars ares.e.m. P values 
determined by one-way ANOVA followed by Bonferroni’s post hoc test (b), 
repeated measures two-way ANOVA followed by Bonferroni’s post hoc test (c), 
repeated measures two-way ANOVA between all genotype and treatment 
groups followed by Bonferroni’s post hoc test (d, e), or repeated measures 
two-way ANOVA followed by Bonferroni's post hoc test for treatment and 
genotype (f). See Supplementary Information for additional statistics. 


are viable** and several MNK inhibitors are entering clinical trials for 
cancer therapy”. Previously available MNK inhibitors were greatly lim- 
ited by specificity and brain penetrance. Our work not only highlights 
anewclass of highly-specific, brain-penetrant MNK inhibitors but also 
expands their application from fragile X syndrome“toanon-syndromic 
model of ASD. The common disruption in translational machinery and 
phenotypic rescue in two very different genetic models indicate that 
genetic heterogeneity of ASD might be reduced toa smaller number of 
cellular core processes. This raises the possibility that pharmacologi- 
cal interventions targeting such core processes may benefit broader 
subsets of patient populations. 
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Methods 


Mice 

Male wild-type, Nign3® (ref. ”) and Fmr1®° (ref. *°) mice were used for 
this study. Note that this strain of Nign3 mice contains a transcrip- 
tional stop cassette”’ preventing genetic compensation events that 
might be triggered in mutants expressing truncated mRNAs”. For 
dopamine neuron-specific manipulations, DAT-cre BAC transgenic 
mice were used*’. Mice were kept on a C57BL/6J background. All ani- 
mals were group housed (weaning at postnatal day (P) 21-P23) under 
a12 h light/dark cycle (06:00-18:00) with food and water ad libitum. 
All physiology and behaviour experiments were performed during the 
light cycle. Embryos for cortical cultures were obtained from NMRI mice 
(Janvier). Allthe procedures performed at University of Lausanne and 
Biozentrum complied with the Swiss National Institutional Guidelines 
on Animal Experimentation and were approved by the respective Swiss 
Cantonal Veterinary Office Committees for Animal Experimentation. 


Pharmacokinetics of ETC-168 

Agroup of 27 male C57BL/6 mice were administered with ETC-168 solu- 
tion formulation in 7.5% NMP, 5% Solutol HS, 10% PG, 30% PEG-400, 
47.5% normal saline at a dose of 10 mg kg™. Blood samples (approxi- 
mately 60 pl) were collected under light isoflurane anaesthesia from 
retro orbital plexus at 0.25, 0.5, 1, 2, 4, 8, 12 and 24 h after drug admin- 
istration. Plasma samples were separated by centrifugation of whole 
blood and stored below -70 °C until analysis. Immediately after col- 
lection of blood, brain samples were collected from each mouse at PD, 
0.25, 0.5, 1, 2,4, 8,12 and 24 h. Brain samples were homogenized using 
ice-cold PBS (pH 7.4) and homogenates were stored below -70 °C until 
analysis. Total homogenate volume was three times the tissue weight. 
Allsamples were processed for analysis by protein precipitation using 
acetonitrile and analysed with fit-for-purpose LC-MS/MS method 
(lower limit of quantification (LLOQ): 2.00 ng mI in plasma and 1.00 
ng ml‘ in brain). Pharmacokinetic parameters were calculated using 
the non-compartmental analysis tool of Phoenix WinNonlin (v.6.3). 


Pharmacological treatment 

For in vitro experiments, ETC-168 was dissolved in DMSO. For in vivo 
treatment, ETC-168 was dissolved in 0.5% methylcellulose (Sigma, 
M7140) and 0.1% Tween-80 (Sigma, P5188) in MilliQ water to 1.25 mg 
mI, and sonicated for 30 min. Animals were gavaged 24 h and 2 hbefore 
behavioural assessment or tissue collection for acute treatment. Inthe 
chronic treatment regime, animals were treated every 24 h, 2h before 
start of the evaluation. The OXTR antagonist L-368,899 (Tocris, 2641) 
was dissolved in saline and 10 mg kg‘ was applied viai.p. injection2h 
before the start of behavioural assessments, or 15 min before the final 
dose of ETC-168. 


Stereotaxic injection 

Injections of diluted (1:4) Red Retrobead were done at P19-P23 and 
performed under isoflurane anaesthesia (Baxter AG). The mice were 
placed inastereotaxic frame (Kopf Instrument) and a single craniotomy 
was made over the NAc medial shell at the following stereotaxic coordi- 
nates: ML+0.55 mm, AP 1.5mm, DV—-4.05 mm from bregma. Injections 
were made with a30-G Hamilton needle (Hamilton, 7762-03) for a total 
volume of 300 nl. Injection sites were confirmed post hoc by immu- 
nostaining. For electrophysiological experiments, the mice were left 
5-8 days after injection before the start of experiment. 

For Mlgn3 conditional knockdown experiments, injections of puri- 
fied AAV2-DIO-miR"®-GFP and AAV2-DIO-miR-GFP were performed as 
previously described*.. In brief, DAT-cre transgenic mice were injected 
at P5-P6 and a single craniotomy was made over the VTA at following 
stereotaxic coordinates: ML +0.15 mm, AP +0.2 mm, DV -4.2mm from 
lambda for P5-P6. Injections were made with a 33-G Hamilton needle 
(Hamilton, 65460-02) for atotal volume of 200 nl. Behavioural testing 


was performed at P28 and injections sites were confirmed post hoc 
by immunostaining as described previously”. In brief, VTA::;DAN3*° 
mice were included if a minimum of 20% of TH-positive cells in the 
VTA were GFP-positive. Mice were excluded from the analysis if their 
body weight was less than 75% of the mean body weight at the start of 
behavioural trials or in case post hoc analysis revealed inefficient or 
off-target viral infection. 


Habituation and novelty recognition task 

Social recognition is considered to be commonly affected in individuals 
on the autism spectrum. Autistic individuals perform poorly on face 
identity recognition tasks, especially when a medium to high time delay 
(seconds to several minutes) between face presentation is applied”. 
To model social recognition in juvenile male mice, we adopted previous 
protocols that had been developed for adult rodents’°™. In this test, 
recognition between juvenile male mice (P26-—P32) was tested with 
5 min inter-trial intervals that mimic the timescales of recognition tasks 
from patient studies. An experimental cage similar to the animal’s home 
cage was used with grid, food, and water removed. The experimental 
mice were acclimated in the cage for 30 min before the start of the test. 
At the start of the first trial, anovel same-sex mouse (stimulus mouse: 
C57BL/6J juvenile male mice, P21-P28) or an object (Lego block) was 
introduced into the cage for 2 min and mice were left to freely interact. 
This was repeated for 4 consecutive trials with 5 min in-between trial 
intervals to allow habituation to the stimulus mouse or object. Onthe 
fifth trial, anovel mouse (littermate to the stimulus mouse) or a novel 
object (dice) was introduced. For the social stimulus, interaction was 
scored when the experimental mouse initiated the action and when 
the nose of the animal was oriented towards the social stimulus mouse 
only. For the object stimulus, interaction was scored when the nose of 
the mouse was oriented 1 cm or less towards the object. The interaction 
time was used to calculate the recognition index as: (interaction trial 
5) - (interaction trial 4). Social recognition in rodents has been reported 
to depend on oxytocin signalling’. To pharmacologically validate the 
recognition task, we treated mice with the oxytocin receptor antagonist 
(L-368,899 10 mg kg ‘injected intraperitoneally 2h before testing). This 
treatment significantly suppressed recognition of novel conspecifics 
in this task (Fig. 1k, |). 


Place-independent cue discrimination and reversal task 

Adult male mice were used for this test. The test box (25 x 35 cm) was 
divided ina waiting and a reward zone by a gated plexiglass wall. Reward 
(condensed milk) was associated to one of the two lining patterns 
of a double tray (white tape versus brown sandpaper). The tray was 
turned in a pseudorandom fashion between trials in order to present 
the rewarded pattern on the right and the left 8 times within a daily 
session of 16 trials. Sliding lids were used to prevent nosepoke in the 
correct target after the mouse madea wrong choice, and to signal end of 
atrial and return to the waiting zone after the bait has been consumed. 
Before the start of the trial, mice were food deprived overnight and 
the reward was presented in the home cage to habituate to the reward. 
For the duration of the test, mice were food restricted overnight and 
receive food ad libitum after completion of the discrimination task. 
After the first night of food deprivation, mice were brought to the test- 
ing arena where they find condensed milk droplets (15 pl) in falcon 
lids similar to those used for the test. First day of habituation takes 
place in groups of cage mates, second day in individual session. Mice 
were shaped to shuttle to the waiting compartment of the arena after 
having consumed the reward. On day 1, mice were trained to find the 
reward only in one of two adjacent falcon lids that have been lined and 
mounted on little stages with a different pattern (brown sandpaper 
vs white tape). Each mouse undergoes 16 daily trials, with a cut-off 
of 20 min. Mice not completing 16 trials by the third testing day were 
excluded from further testing. Mice not attaining learning criterion 
for the first contingency (8 consecutive correct responses over two 
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days) after 6 days of training did not go to the reversal learning step 
(or receive any treatment). Mice were trained for 6 consecutive days 
per week. Feeding was restricted to 1g per mouse overnight. On day 7 
(day 1 of reversal learning training), mice received ETC-168 5 mg kg? 
or vehicle by gavage 120 min before starting the tests for the duration 
of the reversal task. For the contingency reversal learning, mice were 
trained to nosepoke in the previously non-baited pattern in order to find 
reward. The training schedule was the same as before: 16 trials per day, 
until attainment of learning criterion (second day when 8 consecutive 
correct responses have been performed). 


Open field and marble burying 

Male mice were placed individually in the centre of a square open field 
arena (50 x 50 x 30cm) made of grey plastic for 7 min. Velocity (cms) 
and time spend in center (s) was analysed using EthoVision10 system 
(Noldus). The arena was cleaned with 70% ethanol between trials. For 
the marble burying test, animals were placed ina standard type Il cage 
with 5 cm bedding containing 20 identical black marbles distributed 
equally for 30 min. A marble was considered buried if at least two-thirds 
of the marble was covered. 


Electrophysiology 

Male mice (P28-P34) were deeply anaesthetized with isoflurane (4% 
inO,, Vapour, Draeger) before decapitation and brain isolation. Acute 
horizontal brain slices (250 um thick) from the midbrain were cut 
with a vibrating microslicer (Leica VT1200S) in ice-cold oxygenated 
sucrose-based cutting solution containing: NaCl (87 mM), NaHCO, 
(25 mM), KCI (2.5 mM), NaH,PO, (1.25 mM) sucrose (75 mM), CaCl, (0.5), 
MgCl, (7mM) and glucose (10 mM) (equilibrated with 95% O,,5% CO,). 
Slices were immediately transferred to a storage chamber contain- 
ing artificial cerebral spinal fluid (ACSF) containing: NaCl (125 mM), 
NaHCO, (25 mM), KCI (2.5 mM), NaH,PO, (1.25 mM), MgCl, (2mM), CaCl, 
(2.5 mM) and glucose (11 mM), pH 7.4, constantly bubbled with 95% O, 
and 5% CO,; 315-320 mOsm. Slices were maintained at 35 °C in ACSF 
for 60 min and then kept at room temperature before their transfer to 
the recording chamber. During the recordings, the slices were continu- 
ously perfused with ACSF at 35.0 +2.0 °C throughout the experiments. 
Neurons were visualized with a LNScope (Luigs & Neumann) equipped 
with an oblique illumination condenser, a 60x objective (LUMPplanFI, 
NAO.9) and areflected illuminator (Olympus). Slices were illuminated 
with a collimated LED infrared light source (Thorlabs) and wLS LED 
illumination unit (Q-imaging). The recorded neurons in the VTA were 
identified by their anatomical localization and recorded if they were 
labelled with red retrobeads (or by morphology for recording from 
non-retrobead labelled cells). Their dopaminergic identity was sub- 
sequently confirmed by immunohistochemistry. Neuronal activity 
was recorded at the somata with borosilicate glass pipettes (4-6 MO) 
filled with an intracellular solution containing: K-gluconate (125 mM), 
KCI (20 mM), HEPES (10 mM), EGTA (10 mM), MgCl, (2 mM), Na,ATP 
(2mM), Na,-phosphocreatine (1 mM), Na,;GTP (0.3 mM), 0.2% biocytin, 
pH 7.2 (with KOH); 312.3 mOsm. Electrophysiological recordings were 
obtained using a Multiclamp 70OB amplifier and digitized at 10 kHz. 
Recording of spontaneous firing activity in VTA in dopaminergic neu- 
rons was achieved in voltage clamp cell-attached configuration after 
formation of a gigaseal. 

Baseline activity was recorded during 2 or 3 min (sweeps of Ss dura- 
tion were acquired every 10 s) before oxytocin (1 1M; TOCRIS, 1910) 
applied to the perfusion reached the recording chamber. Recordings 
in the presence of oxytocin were continued for 3 min. At the end of 
these recordings, a whole-cell configuration in voltage clamp (hold- 
ing potential: —-55 mV) was achieved by disruption of the gigaseal with 
gentle negative pressure applied through the recording pipette. A 
period of 5 min was then allowed for loading the neurons with biocy- 
tin. Finally, the recording pipette was carefully removed and the slice 
transferred to paraformaldehyde (4% in PBS) for subsequent anatomical 


analysis. All TH’ cells, including cells negative for retrobeads, were used 
to analyse frequency. Only TH’ and retrobead positive cells recorded 
for a minimum of 3 min after OXT application were used to analyse 
OXT response, and all data points represent an average per minute. /, 
recordings were performed in whole-cell voltage-clamp configuration 
after compensating the pipette capacitance and the series resistance 
(R,) (15-20 MO; 40-50%, bandwidth 3.5 kHz). R, was monitored on-line 
and the experiments were discarded if the R, changed >20%. 

Neuronal input resistances (R,) and membrane capacitances were 
evaluated for each neuron by injecting a voltage command of —5 mV 
for 200 ms duration, from a holding potential (HP) of -50 mV. The 
electrical capacitance of the neurons was determined by fitting a 
bi-exponential function to the capacitive current at the beginning of 
a-5 mV pulse (pipette capacitance was previously compensated dur- 
ing the cell-attached configuration prior whole-cell formation). The 
amplitude-weighted time constant (t,.) and measured current peak 
(Iyeax) amplitude was inserted into the following formula to calculate 
the electrical capacitance. 


weighted 7,, [ms] x Ipeax [PA] 
5 [mv] 


C [pF]= 


Weighted r,. was calculated from the bi-exponential fitting as: 


_ (AIX G) +(A2 xt) 
"ve Al+ A2 


,,was evoked by injecting a series of 1.5 s duration hyperpolarizing 
voltage commands (V.omm) froma HP of -50 mV to -130 mV, in consecu- 
tive steps of 10 mV. /,,..4, currents were evoked at —130 mV at the end 
of each V.omm before returning the HP to -50 mV. /y.1ai, voltage depend- 
ency was calculated by plotting their normalized amplitudes. Maximal 
Iya Currents (/,,,,) as defined as recorded after the V.o4m to -130 mV, 
was Set to 1; minimal /,,, currents (/,,;,) as recorded after the HP of 
-50 mV, was set to 0. A Boltzmann function was fitted: = Ijin + max ~/min)/ 
1+ exp[(Vso — Veomm)/S]; in which V5) is the half-activation potential and 
Sis the slope factor. After recording the /,,, each cell was switched to 
current clamp mode to record the resting membrane potential. The 
experimenter was blinded to the genotype and treatment condition. 


Analysis of mRNA translation 

Translation in the VTA was analysed using FUNCAT*", in which AHA 
is incorporated into cells and then detected using an alkyne tagged 
to Alexa 488. Mice were anaesthetized at P28 and 250-um thick hori- 
zontal slices were cut on a vibratome and placed in ACSF for 45 min 
at 35 °C to recover. The same cutting solution and ACSF as described 
for electrophysiology was used. Slices were moved to an incubation 
chamber and incubated for an additional 3 h with 1 mM AHA (Jena Bio- 
science, CLK-AAOOS). At the end of incubation, slices were transferred 
toice-cold 4% paraformaldehyde (PFA) and left overnight at 4 °C. The 
next day, slices were incubated for 1.5 hina blocking solution contain- 
ing 5% BSA (Sigma), 5% normal donkey serum and 0.3% Trition X-100 
in 1x PBS at agitation at room temperature. Slices were then washed 
3 x 5 min followed by incubation over night with gentle agitation with 
500 pl Click-iT reaction cocktail according to the manual instructions 
(Click-iT Cell Reaction Buffer Kit, Invirogen, C10269). Slices were then 
washed 3 x 10 min with 1x PBS followed by incubation with anti-TH 
(Millipor, AB1542, 1:1,000) primary antibody at room temperature 
for 2h, washed three times in 1x PBS, followed by incubation for 2 hat 
room temperature with a secondary antibody. The sections were then 
washed three times in 1x PBS before mounted onto microscope slides 
with ProLong Gold antifade (Invitrogen, p36930). Images were taken on 
an Olympus SpinSR spinning disk with a UPL S APO 30x objective (NA 
1.5). Allimages were taken with the same laser power, gain and exposure 
settings, and analysed using ImageJ. For analysis of AHA incorporation, 


10 images were taken of the VTA and the Alexa-488 mean fluorescent 
intensity of 9-20 regions of interest (ROI) corresponding to TH* somas 
were measured. The mean fluorescent intensity of cells per image was 
used for analysis, and the experimenter was blinded to genotype and 
treatment. Allimages within each experiment were processed in paral- 
lel using identical settings in ImageJ (NIH) and Adobe Photoshop CS 
8.0 (Adobe systems). 

For puromycinincorporation, 400-um thick coronal slices were cut 
ona vibratome and placed in ACSF (in mM: 125 NaCl, 2.5 KCI, 1 MgCL, 
2CaCl,, 1.25 NaH,PO,, 20 glucose, 26 NaHCO,, 95% O,,5% CO,) for 30 min 
at room temperature followed by 2 hat 32 °C. Puromycin (5 pg pl; 
Sigma) was added for 45 min to label newly synthesized proteins. Sec- 
tions were snap frozen and subsequently lysed in 10 mM HEPES, 1% 
SDS, 1 mM NaF, 1mM NaVO, containing protease and phosphatase 
inhibitor cocktail (Roche Applied Science), sonicated and incubated 
for 10 min at 95 °C. Puromycin incorporation was measured by western 
blot (see ‘Western blot and AlphaLISA immunoassay’) using mouse 
anti-puromycin antibody (EQO001), Kerafast). The results were normal- 
ized to signal obtained with anti-calnexin antibodies run at the same 
time ona different blot. 


Western blot and AlphaLISA immunoassay 

Cortical neurons and brain tissue were homogenized in lysis buffer 
containing 137 mM NaCl, 2.7 mM KCI, 10 mM Na,HPO,, 1.8 mM KH,PQ,, 
5mMEDTA, 1% Triton X-100 and complete protease and phosphatase 
inhibitors (Roche Applied Science). Immunoblotting was done with 
HRP-conjugated secondary antibodies and Pierce ECL Western Blot- 
ting Substrate. The following primary antibody was used in this study: 
p-elF4E (Abcam, ab76256 1:1000), elF4E (Abcam, ab47482 1:1,000), 
p-ERK1/2 (Cell Signaling, 4370S 1:1,000), ERK1/2 (Cell Signaling, 4695S 
1:1,000), p-elF4G (Cell Signaling, 2441S 1:1,000), elF4G (Cell Signal- 
ing, 2498 1:1,000), p-MNK1 (Cell Signaling, 2111, 1:1,000), MNK1 (Cell 
Signaling, 2195S, 1:1,000), GAPDH (Cell Signaling, 5174 1:2,000) and 
calnexin (Stressgen, SPA-865 1:2,000). Loading controls were run on 
the same gel, and for some experiments Mini PROTEAN TGX Stain-Free 
Gels (Bio-Rad) were used as loading controls. Signals were acquired 
using an image analyser (Bio-Rad, ChemiDoc MP Imaging System) and 
images were analysed and prepared using Image]. 

For additional measurements of elF4E phosphorylation state, the 
AlphaLISA SureFire Ultra p-elF4E (Ser209) Assay Kits (PerkinElmer) 
were used according to the manufactures protocol. AlphaLISA signals 
were measured using a Tecan SPARK plate reader on recommended 
settings. 


Immunohistochemistry and imaging 

Animals were transcardially perfused with fixative (4% paraformal- 
dehyde in 100 mM phosphate buffer, pH 7.4) at P26-P32. Brains were 
post-fixed overnight at 4 °C, incubated in 30% sucrose in 1x PBS for 
48h, and snap frozen on dry ice. Tissues were sectioned at 35 um ona 
cryostat (Microm HM6S0, Thermo Scientific). Floating sections were 
kept in 1x PBS before incubation with blocking solution containing 
0.5% Triton X-100 in 1x TBS and 10% normal donkey serum. The slices 
were incubated with primary antibody at 4 °C overnight and washed 
three times in1x TBS containing 0.5% Triton X-100, followed by incuba- 
tion for 2h at room temperature with a secondary antibody. The sec- 
tions were washed three times in 1x TBS containing 0.5% Triton X-100 
before mounted onto microscope slides with ProLong Gold antifade 
(Invitrogen, p36930). For post hoc confirmation of TH-positive cells 
after electrophysiology, the sections were fixed in the same fixative 
as described above, washed three times with 1x PBS before blocking 
and incubation with anti-TH antibody using same method as above. 
The following primary antibodies were used for this study: sheep 
anti-TH (Millipor, AB1542, 1:1,000) and mouse-anti neurophysin-1 (Mil- 
lipore, MABN844, 1:2,000). Secondary antibodies used were: donkey 
anti-sheep IgG-Cy3 (713-165-147), donkey anti-sheep Cy5 (713-175-147), 


donkey anti-rabbit IgG Cy3 (711-165-152), goat anti-mouse Cy2 (714- 
225-150) all from Jackson ImmunoResearch. Streptavidin DyLight 488 
(Thermo Scientific 21832, 1 mg mI”), was used to visualize biocytin. 
Hoechst dye was co-applied with the secondary antibody at a final 
concentration of 0.5 pg ml. Images were acquired ona custom-made 
dual spinning disk microscope (Life Imaging Services GmbH) using 10x 
and 40x objectives. Images were taken bilaterally along the whole VTA 
and PVN dorso-ventral axis and images from at least four (VTA) or five 
(PVN) planes were analysed. OXT* cells in the PVN were counted manu- 
ally. Neurophysin-1 area coverage and puncta intensity was measured 
using the particle measurement tool in ImageJ on sum projections. All 
images within each experiment were processed in parallel with identical 
settings using ImageJ and Adobe Photoshop CS 8.0 (Adobe systems). 


FISH 

Mice were deeply anaesthetized using isoflurane inhalation. Brains 
were quickly removed and snap frozen on dry ice before storage at 
-80 °C. Brains were cut on acryostat into 10-tm sections, adhered to 
Superfrost ultra plus slides (Thermo Scientific) and stored at —80 °C. 
Sections were fixed for 30 min in 4% PFA before being processed using 
the RNAscope Fluorescent Multiplex Kit (ACD) according to the manu- 
facturers instruction. The following probes were used: Oxtr (C3, 412171), 
Viar (C3, 418061), Th (Slc6a3-C2, 315441) and Nlgn3 (C1, 497661). Probes 
were combined as Oxtr/Th/Nlgn3 or Viar/Th/Nlgn3. Amp-4-Alt B were 
used for all combinations. Sections were imaged on a custom-made 
dual spinning disk microscope (Life Imaging Services GmbH) using 
40x objective, with 12 section z-stacks with 0.2 pm in-between z sec- 
tions. Images were processed in ImageJ by doing sum projection of 
the z-stacks, followed by analysis of fluorescent intensity and number 
of puncta. Cell types were identified based on the presence of Th and 
DAPI. A ROI was drawn around the cell to define the area using DAPI, 
and only cells with no adjacent DAPI staining was used to avoid false 
positives from signals froma second cell. Dots inthe ROI were manually 
counted and fluorescent intensity was analysed using Image]. Images 
were assembled with identical settings using ImageJ and Adobe Pho- 
toshop CS 8.0 (Adobe systems). 


Cell culture 

Cortical cultures were prepared from E16.5 mouse embryos. Neocor- 
tices were dissociated by addition of papain (130 units, Worthington 
Biochemical LKO03176) for 30 min at 37 °C. Cells were maintained in 
neurobasal medium (Gibco 21103-049) containing 2% B27 supplement 
(Gibco 17504-044), 1% Glutamax (Gibco 35050-038), and 1% penicillin/ 
streptomycin (Sigma P4333). At DIV14, the cells were treated with either 
vehicle (DMSO) or different doses of ETC-168 for 3 h before collecting 
for western blot. 


Proteomic analysis 

VTA tissue was microdissected from coronal sections using anatomical 
landmarks. Dissected tissue was snap frozen in liquid nitrogen. Success- 
ful recovery of proteins from dopaminergic neurons was confirmed 
by quantitative assessment of enrichment of dopaminergic markers 
(dopamine transporter, tyrosine hydroxylase, dopamine decarboxy- 
lase) as compared to other brain regions (Extended Data Fig. 4a). 


Sample preparation for LC-MS analysis. Tissue was washed twice with 
PBS and dissolved in 50 pl lysis buffer (1% sodium deoxycholate, 0.1M 
ammonium bicarbonate), reduced with5 mM TCEP for 15 minat 95 °C 
and alkylated with 10 mM chloroacetamide for 30 min at 37 °C. Samples 
were digested with trypsin (Promega) at 37 °C overnight (protein to 
trypsin ratio: 50:1). To each peptide samples an aliquot of a heavy ref- 
erence peptide mix containing 10 chemically synthesized proteotypic 
peptides (Spike-Tides, JPT) was spiked into each sample at a concentra- 
tion of 5 fmol of heavy reference peptides per 1 pg of total endogenous 
protein mass. Then, the peptides were cleaned up using iST cartridges 
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(PreOmics) according to the manufacturer’s instructions. Samples were 
dried under vacuum and stored at —80 °C until further use. 


Targeted PRM-LC-MS analysis of protein isoforms. In a first step, 
parallel reaction-monitoring (PRM) assays” were generated froma 
mixture containing 100 fmol of each heavy reference peptide and 
shotgun data-dependent acquisition (DDA) LC-MS/MS analysis on 
a Thermo Orbitrap Fusion Lumos platform (Thermo Fisher Scien- 
tific). The setup of the LRPLC-MS system was as described previously 
(Pubmed-ID: 27345528). Chromatographic separation of peptides 
was carried out using an EASY nano-LC 1200 system (Thermo Fisher 
Scientific), equipped with a heated RP-HPLC column (75 ppm x 30 cm) 
packed in-house with 1.9 pm C18 resin (Reprosil-AQ Pur, Dr. Maisch). 
Peptides were analysed per LC-MS/MS run using a linear gradient 
ranging from 95% solvent A (0.15% formic acid, 2% acetonitrile) and 5% 
solvent B (98% acetonitrile, 2% water, 0.15% formic acid) to 45% solvent 
B over 60 minata flowrate of 200 nl min”. Mass spectrometry analysis 
was performed on Thermo Orbitrap Fusion Lumos mass spectrometer 
equipped with a nanoelectrospray ion source (both Thermo Fisher 
Scientific). Each MS1scan was followed by high-collision-dissociation 
(HCD) of the 10 most abundant precursor ions with dynamic exclusion 
for 20 s. Total cycle time was approximately 1s. For MS1, 1 x 10° ions 
were accumulated in the Orbitrap cell over a maximum time of 100 ms 
and scanned at aresolution of 120,000 FWHM (at 200 m/z). MS2 scans 
were acquired at a target setting of 1 x 10° ions, accumulation time of 
50 ms and aresolution of 30,000 FWHM (at 200 m/z). Singly charged 
ions and ions with unassigned charge state were excluded from trig- 
gering MS2 events. The normalized collision energy was set to 30%, 
the mass isolation window was set to 1.4 m/z and one microscan was 
acquired for each spectrum. 

The acquired raw-files were database searched against a mouse data- 
base (Uniprot, download date: 19 April 2017, total of 34,490 entries) 
by the MaxQuant software (v.1.0.13.13). The search criteria were set as 
following: full tryptic specificity was required (cleavage after lysine 
or arginine residues); three missed cleavages were allowed; carba- 
midomethylation (C) was set as fixed modification; Arg10 (R), Lys8 
(K) and oxidation (M) as variable modification. The mass tolerance 
was set to 10 ppm for precursor ions and 0.02 Da for fragment ions. 
The best six transitions for each peptide were selected automatically 
using an in-house software tool and imported to Skyline (v.1; https:// 
brendanx-uwl.gs.washington.edu/labkey/project/home/software/ 
Skyline/begin.view). A mass isolation lists containing all selected pep- 
tide ion masses were exported form Skyline, split into eight mass lists by 
charge, C-terminal amino acid and transition and precursor ions. The 
mass lists were then imported into the Lumos operating software for 
SureQuant analysis using the following settings: The resolution of the 
orbitrap was set to 30 k (120 k) FWHM (at 200 m/z) for heavy (light) pep- 
tide ions and the fill time was set to 54 (246) ms, respectively, to reacha 
target value of 1 x 10° ions. lonisolation window was set to 0.4 thomson 
(Th) and the scan range was set to 150-1,500 Th. The mass window for 
triggering heavy PRM scans was set to 10 ppm and the depended PRM 
triggering threshold for the light channel was set to a minimum of 2 
detected transitions. A MS1scan using the same conditions are for DDA 
was included in each MS cycle. Each condition was analysed in biologi- 
cal quadruplicates or quintuplicates. All raw-files were imported into 
Skyline for protein / peptide quantification. To control for variationin 
sample amounts, the total ion chromatogram (only comprising pep- 
tide ions with two or more charges) of each sample was determined by 
Progenesis QI (v.2.0, Waters) and used for normalization. Asummary 
of the peptides used is shown in Supplementary Table 1. 


Global proteome analysis using shotgun proteomics 

One microgram of peptides of each sample was subjected to LC-MS 
analysis using a using a Q-Exactive HF mass spectrometer connected to 
an electrospray ion source (both Thermo Fisher Scientific) as recently 


specified®* and acustom-made column heater set to 60 °C. In brief, pep- 
tide separation was carried out using an EASY nLC-1000 system (Thermo 
Fisher Scientific) equipped with a RP-HPLC column (75 pm x 30 cm) 
packed in-house with C18 resin (ReproSil-Pur C18-AQ, 1.9 um resin; 
Dr. Maisch GmbH) using a linear gradient from 95% solvent A (0.1% 
formic acid, 2% acetonitrile) and 5% solvent B (98% acetonitrile, 0.1% 
formic acid) to 45% solvent B over 60 minat a flow rate of 0.2 pl min”. 
A total of 3 x 10° and 1 x 10° ions were accumulated for MS1 and MS2, 
respectively, and scanned at a resolution of 60,000 and 15,000 FWHM 
(at 200 m/z). Fill time was set to 110 ms for MS1 and 50 ms for MS2 
scans. For MS2, a normalized collision energy of 28% was used, the 
ion isolation window was set to 1.4 Th and the first mass was fixed to 
100 Th. To determine changes in protein expressions across samples, 
a MSI based label-free quantification was carried out. Therefore, the 
generated raw files were imported into the Progenesis QI software 
(Nonlinear Dynamics, v.2.0) and analysed using the default param- 
eter settings. MS/MS-data were exported directly from Progenesis QI 
in mgf format and searched against a decoy database of the forward 
and reverse sequences of the predicted proteome from Mus musculus 
(Uniprot, download date: 18 April 2017, total of 34,490 entries) using 
MASCOT (v.2.4.1). The search criteria were set as following: full tryptic 
specificity was required (cleavage after lysine or arginine residues); 
three missed cleavages were allowed; carbamidomethylation (C) was 
set as fixed modification; oxidation (M) as variable modification. The 
mass tolerance was set to 10 ppm for precursor ions and 0.02 Da for 
fragment ions. Results from the database search were imported into 
Progenesis QI and the final peptide measurement list containing the 
peak areas of all identified peptides, respectively, was exported. This 
list was further processed and statically analysed using our in-house 
developed SafeQuant R script (SafeQuant; https://github.com/eahrne/ 
SafeQuant*). The peptide and protein false discovery rate was set to 
1% using the number of reverse hits in the data set. All quantitative 
analyses were performed in biological quintuplicates. Proteins with 
less than 1 peptide were excluded from the analysis. 

The results details of the proteomics experiments carried out includ- 
ing identification scores, number of peptides quantified, normalized 
(by sum ofall peak intensities) peak intensities, log,-transformed ratios, 
coefficients of variations and p-values for each quantified protein. 
Asummary is displayed in Supplementary Table 2. 


Global proteome analysis using tandem mass tags 
Sample aliquots (prepared as described above) containing 10 pg of 
peptides were dried and labelled with tandem mass isobaric tags (TMT- 
pro 16-plex, Thermo Fisher Scientific) according to the manufacturer’s 
instructions. To control for ratio distortion during quantification, a 
peptide calibration mixture consisting of six digested standard pro- 
teins mixed in different amounts were added to each sample before 
TMT labelling as recently described®. After pooling the differentially 
TMT labelled peptide samples, peptides were again desalted on 
C18 reversed-phase spin columns according to the manufacturer’s 
instructions (Macrospin, Harvard Apparatus) and dried under vacuum. 
TMT-labelled peptides were fractionated by high-pH reversed phase 
separation using a XBridge Peptide BEH C18 column (3.5 pm, 130 A, 1 
mm x 150 mm, Waters) on an Agilent 1260 Infinity HPLC system. Pep- 
tides were loaded on columnin buffer A (ammonium formate (20 mM, 
pH 10) in water) and eluted using a two-step linear gradient starting 
from 2% to 10% in 5 min and then to 50% (v/v) buffer B (90% acetoni- 
trile/10% ammonium formate (20 mM, pH10) over 55 min ata flow rate 
of 42 pI min”. Elution of peptides was monitored with a UV detector 
(215 nm, 254 nm). A total of 36 fractions were collected, pooled into 12 
fractions using a post-concatenation strategy as previously described”’, 
dried under vacuum. 

Aliquots of 1 pg of peptides were analysed by LC-MS as previously 
described**. Chromatographic separation of peptides was carried 
out using an EASY nano-LC 1000 system (Thermo Fisher Scientific), 


equipped with a heated RP-HPLC column (75 pm x 37 cm) packed 
in-house with 1.9 im C18 resin (Reprosil-AQ Pur, Dr. Maisch). Aliquots 
of 1 pg total peptides were analysed per LC-MS/MS run using a linear 
gradient ranging from 95% solvent A (0.15% formic acid, 2% acetonitrile) 
and 5% solvent B (98% acetonitrile, 2% water, 0.15% formic acid) to 30% 
solvent B over 90 minata flow rate of 200 nl min“. Mass spectrometry 
analysis was performed on Q-Exactive HF mass spectrometer equipped 
with a nanoelectrospray ion source (both Thermo Fisher Scientific). 
Each MSI scan was followed by high-collision-dissociation (HCD) 
of the 10 most abundant precursor ions with dynamic exclusion for 
20 s. Total cycle time was approximately 1 s. For MSI, 3 x 10° ions were 
accumulated in the Orbitrap cell over amaximum time of 100 ms and 
scanned at a resolution of 120,000 FWHM (at 200 m/z). MS2 scans 
were acquired at a target setting of 1 x 10° ions, accumulation time of 
100 ms anda resolution of 30,000 FWHM (at 200 m/z). Singly charged 
ions and ions with unassigned charge state were excluded from trig- 
gering MS2 events. The normalized collision energy was set to 35%, 
the mass isolation window was set to 1.1 m/z and one microscan was 
acquired for each spectrum. 

The acquired raw-files were searched against a protein data- 
base containing sequences of the predicted SwissProt entries of 
Mus musculus (www.ebi.ac.uk, release date 27 March 2019), the six 
calibration mix proteins and commonly observed contaminants (in 
total 17,412 sequences) using the SpectroMine software (Biognosys, 
v.1.0.20235.13.16424) and the TMT 16-plex default settings. In brief, the 
precursor ion tolerance was set to 10 ppm and fragment ion tolerance 
was set to 0.02 Da. The search criteria were set as follows: full tryptic 
specificity was required (cleavage after lysine or arginine residues 
unless followed by proline), 3 missed cleavages were allowed, carba- 
midomethylation (C), TMTpro (K and peptide n-terminus) were set as 
fixed modification and oxidation (M) as a variable modification. The 
false identification rate was set to 1% by the software based on the num- 
ber of decoy hits. Proteins that contained similar peptides and could 
not be differentiated based on MS/MS analysis alone were grouped 
to satisfy the principles of parsimony. Proteins sharing significant 
peptide evidence were grouped into clusters. Acquired reporter ion 
intensities inthe experiments were employed for automated quantifi- 
cation and statically analysis using a modified version of our in-house 
developed SafeQuant R script (v.2.3)*. This analysis included adjust- 
ment of reporter ion intensities, global data normalization by equal- 
izing the total reporter ion intensity across all channels, summation of 
reporter ion intensities per protein and channel, calculation of protein 
abundance ratios and testing for differential abundance using empiri- 
cal Bayes moderated t-statistics. Finally, the calculated P values were 
corrected for multiple testing using the Benjamini-Hochberg method. 
Asummary is shown in Supplementary Table 3. 


GO, network and gene set enrichment analysis 

GO analysis was performed using DAVID classification system (https:// 
david.ncifcrf.gov). Proteins significantly different between vehicle 
treated wild type and Nign3 (P< 0.05) were compared to all proteins 
detected in the proteomic screen using the GO GOTERM _BP_DIRECT 
annotation dataset with minimum number of hits set to five and maxi- 
mum P value threshold to 0.05 with Benjamini correction (P< 0.05). 
Network analysis was obtained using String vll database“. Each node 
represents a protein altered in Nign3® VTA compared to wild-type 
VTA in vehicle-treated conditions, and each edge show protein-pro- 
tein interaction as determined by experiments and databases. The 
highest confidence (0.900) was used for interaction scores. Discon- 
nected nodes and networks containing more than six proteins were 
removed. Gene set enrichment analysis for TMT proteomic data was 
performed using all proteins detected with at least two peptides with 
PSEA-Quant*. The list of GO terms with Q < 0.01 were summarized 
using REVIGO® with small (0.5) allowed similarity, and displayed using 
Cytoscape. 


Statistical analysis 

No statistical methods were used to predetermine sample size. The 
animals were randomly assigned to each group the moment of drug 
treatment, and a minimum of three independent cohorts were used 
for behavioural experiments. Statistical analysis was conducted with 
GraphPad Prism 8. The normality of sample distributions was assessed 
and when violated non-parametrical tests were used. When normally 
distributed, the data were analysed with unpaired t-test for comparison 
between two groups, while for multiple comparisons one-way ANOVA 
and repeated-measures ANOVA were used. For the analysis of variance 
with two factors (two-way ANOVA, repeated-measures two-way ANOVA 
and repeated measures two-way ANOVA by both factors), normality of 
sample distribution was assumed, and followed by Bonferroni post hoc 
test. Differences in frequency distribution were assessed using the Kol- 
mogorov-Smirnov test. All the statistical tests adopted were two-sided. 
When comparing two samples distributions similarity of variances was 
assumed, therefore no corrections were adopted. Outliers were identi- 
fied using ROUTS test on the most stringent setting (Q=0.1%). Data are 
represented as the mean +s.e.m. and the significance was set at P< 0.05. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Raw mass spectrometry data associated with the manuscript have been 
deposited in to the ProteomeXchange Consortium via the PRIDE 
partner repository with the dataset identifier PXDO18808 and 10.6019/ 
PXDO18808. All renewable reagents and detailed protocols will be 
made available on request. Source data are provided with this paper. 
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Extended Data Fig. 1|Loss of social recognition in Nign3” mice. a, b, Mean (n=8).g,h, Mean social interaction time and data for individual mice plotted 
social interaction time and data for individual mice in the social habituation/ for control mice (g, VTA::DA-miR, n=10) and VTA DA-specific Nign3 
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c,d, Mean social interaction time and data for individual mice plotted for test. g, Mean social interaction time and data for individual mice plotted for 
DAT-cre control mice (n=10) (c) and DAT-cre::Nign3 mice (n=11) (d). mice treated with vehicle (n =12) (i) and OXTR-A (n=11) (j). All error bars are 
e, Example for validation of targeted gene knockdown (n=8 mice) from s.e.m. Repeated-measures one-way ANOVA followed by Bonferroni’s post hoc 
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SNc of DAT-cre mice that express GFP from the AAV2-DIO-miR-GFP vector information for additional statistics. 
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Extended Data Fig. 2 | Properties of NAc-projecting VTA DA neuronsin 
wild-type and Mgn3” mice. a, Representative /, currents recorded from 
wild-type (black) and N/ign3* (blue) neurons evoked by consecutive 
hyperpolarizing voltage steps of -10 mV from -—50 to -130 mV (bottom). At the 
end of each voltage step, the voltage command was returned to -130 mV to 
evoke tail currents (/),..a1, depicted with an arrowhead). Red lines show fit ofa 
single exponential function used to assess the /, activation kinetics. 

b, Averaged /, amplitudes were plotted against the voltage step. /, current 
amplitudes were measured at the steady state (indicated witha filled circle ina) 
and the leak current values, as defined as the amplitude of the instantaneous 
currents at the onset of voltage steps (indicated with an asterisk ina), 
subtracted. c, Voltage-dependency of /,,... currents. /,,.a amplitudes were 
normalized relative to /,.. at -5O mV and -130 mV. Solid lines show fits witha 
Boltzmann function for least square fit. The P value shows the difference in VSO 


between datasets. d, Activation kinetics of the /,as determined by the rvalues 
obtained from the exponential fitting, as a function of the voltage commands 
voltage. Only the values obtained from commands between -130 mV to-90 mV 
were evaluated. e, Comparison between groups of the resting membrane 
potential as assessed with current clamp recordings. f. Membrane capacitance 
(C,,) of wild-type and Mlgn3* DA neurons. g, Input resistant values (R,) for 
wild-type and Mgn3“ mice. C,, and R; values were obtained in voltage clamp 
mode by applying a—5 mV (200 ms) voltage command froma holding potential 
set at -5O mV. Allerror bars ares.e.m. The Pvalue represents genotype 
differences. n=5 mice per genotype, numbers on graphs represent cells. 
Pvalues determined by repeated-measures two-way ANOVA (b), Boltzmann 
sigmoidal test (c), mixed-effects model (d), unpaired two-sided t-test (e, f), or 
two-sided Mann-Whitney test (g). See Supplementary Information for 
additional statistics. 
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Extended Data Fig. 3 | Oxytocinergic innervation to VTA and Avprla mRNA 
levels are not affected in Nign3 mice. a, Representative images from 3 mice 
per genotype of neurophysin 1 (green), acleavage product of the oxytocin 
neuropeptide precursor that is transported in vesicles together with 
oxytocin®, and TH (red) immunofluorescence in the VTA of wild-type and 
Nign3*° mice. Note that oxytocinergic axons arise from multiple hypothalamic 
nuclei, including the paraventricular nucleus. b,c, Mean VTA area coverage (b) 
and puncta fluorescence in the VTA (c) from wild-type and Nign3 mice. n=3 
mice per genotype. Numbers in brackets represent sections. d, Quantification 
of mean Thintensity per TH’ cell. e, Quantification of Nign3 puncta per 100 pm? 
TH cell. f, Quantification of Oxtr puncta per 100 pm? TH’ cell. n=WT: 280 cells 


from 4 mice; Mgn3: 265 cells from 3 mice (d-f). g, Targeted proteomic (PRM) 
measurements for oxytocin receptor (OXTR; left) and AVPRIA (right) proteins 
in VTA. Numbers on bars indicate mice. h, Representative images of FISH 
labelling of Aurpla (cyan), Th (red) and Nign3 (green) inthe VTA from wild-type 
and Nign3 mice. Experiment was repeated independently twice. 

i, Quantification of mean Avpria intensity per TH’ cell.j, Quantification of 
Aupria puncta per 100 pm? TH‘ cell from wild-type and Nlgn3 VTA. n= wild 
type: 169 cells from 4 animals; Nign3*°: 200 cells from 3 mice (i,j). Allerror bars 
ares.e.m.Pvalues determined by unpaired two-sided f-test (b,c, g), two-sided 
Mann-Whitney Utest (d-f, i,j). See Supplementary Information for additional 
Statistics. 
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Extended Data Fig. 4| Ribosomal proteins and translation processes are 
altered in Mgn3 mice. a, TMT proteomics: graphs plotting abundance of 
dopamine markers and synaptic proteins from VTA, cortex and hippocampus. 
Dopaminergic markers are strongly enriched in VTA samples. n=5 mice per 
brain region. b, c, Proteomic analysis of wild-type and Nign3*° VTA, n=5 mice 
per genotype. b, Enrichment of GOterms for biological processes for proteins 
significantly altered (P< 0.05) inNign3* mice compared to wild-type mice. 

c, Network-based analysis of proteins altered in Nign3*° VTA (P< 0.01). Blue 


nodes indicate downregulated proteins, red nodes upregulated proteins, light 
blue lines indicate interactions known from database and purple lines 
interactions experimentally determined. Disconnected nodes and nodes 
containing less than six proteins are not shown. See methods for additional 
information of statistics and analysis parameters. d, Mean puromycin 
incorporation in acute cortical slices from adult wild-type and Nign3” mice. All 
error bars ares.e.m. Pvalues determined by two-sided Mann-Whitney Utest. 
See Supplementary information for additional statistics. 
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Extended Data Fig. 5 | Pharmacological profile of novel MNK1/2 inhibitor 
ETC-168.a, b, Quantification of p-ERK1/2 (a) and p-elF4G (b) levels compared 
tonon-phosphorylated protein in cortical neurons at DIV14 treated with ETC- 
168.n=8 replicates from 3 independent experiments. c,d, Quantification of 


elF4E (c), ERK1/2 (d) and elF4G (e) levels normalized to calnexin in cortical 
neuronsat DIV14 treated with ETC-168. Pvalues determined by one-way 
ANOVA. n=8 replicates from 3 independent experiments. f, Representative 


western blot of cerebellar lysate from wild-type mice treated with vehicle or 
ETC-168 and quantification of p-elF4E levels compared to elF4E. n= vehicle: 4; 


Imgkg?:3;5mgkg™: 6. g, Representative western blot of VTA lysate from 
wild-type and Nign3® mice (n=7 mice per genotype). h, Quantification of 


ETC-168 Q 
ae i 
| jm net cee o& 
p-elF4G — © 
CIFAG |e ee ee et ine 
oO 
calnexin |=" "sr wt} o 
cortical culture 
ETC-168 
P=0.9787 f pot p27 pas 1.5 
— 24h [2h 
ee A A Ww 
Ss Cy ti 
Fey? Citten 14.0 
ETC-168 wl 
p-CIF4E [sem see 50.5 
CIF4E| me & 
GAPDH| ee sate stem} = 0.0 
we) cerebellum ros s® s® 
» RRS o* 
ETC-168 


@ 1.5, P=0.6200 


p-elF4E AlphaLisa 
(normalized to wild type 


p-elF4E compared with elF4E levels in VTA of wild-type mice and Nlgn3” mice. 
n=7 mice per genotype. i, Normalized p-elF4E AlphaLisa counts from wild-type 
and Nign3“ VTA lysate. n=7 mice per genotype.j, k, Representative western 
blot (j) and quantification (k) of p-MNK1and MNK1levelsin VTA lysate from 
wild-type mice and Mign3” mice. n=7 mice per genotype. 1, Normalized 
p-elF4E AlphaLisa counts from VTA from wild-type treated with 5 mg kg“ ETC- 
168 for 24h+2h.n=5 mice per genotype. All error bars ares.e.m. Pvalues 
determined by one-way ANOVA (a, c-f), Kruskal-Wallis test (b), unpaired 
two-sided f-test (h, k), or two-sided Mann-Whitney test (i, 1). 

See Supplementary Information for additional statistics. 
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c, Treatment schedule of Fmr1l” and Fmr1*™ mice. Mice were treated daily post hoctest. See Supplementary Information for additional statistics. 
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proteomic-normalized protein expression of dopaminergic markers in VTA 
from wild-type and Mign3” mice treated with vehicle or ETC-168. Mice were 
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Extended Data Fig. 8 | Effect of short-term ETC-168 treatment on social 
recognition in wild-type and Mgn3” mice. a-d, Time course of time 
interacting inthe social habituation/recognition test for mice after short-term 
treatment with ETC-168. a, Wild-type mice treated with vehicle (n=12). 

b, Wild-type mice treated with ETC-168 (n=10).c, Nign3* mice treated with 
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vehicle (n=9).d, Nign3® mice treated with ETC-168 (n=11). Error bars report 
s.e.m. Pvalues determined by Friedman’s test followed by Dunn’s post hoc test 
for planned multiple comparison (a, c, d) or repeated-measures one-way 
ANOVA followed by Bonferroni’s post hoc test for planned multiple comparison 
(b). See Supplementary Information for additional statistics. 
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Extended Data Fig. 9 | Effect of ETC-168 treatment is dependent onthe 168 plus vehicle (n= 9) (d), or ETC-168 plus OXTR-A (n= 8) (e). Error bars report 
oxytocin receptor. a, Experimental outline. b, Mean social interaction time in s.e.m. Pvalues determined by repeated-measures two-way ANOVA followed by 
Nign3* mice treated with 5 mg kg ‘ETC-168 and either vehicle or 10 mg kg? Bonferroni's post hoc test (b), unpaired two-sided f-test (c), 
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Numbers on graph indicate mice. d, e, Individual values and mean of time additional statistics. 
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Extended Data Fig. 10| Effect of long-term ETC-168 treatment on behaviour 
in wild-type and Mgn3 mice. a, Experimental schematics of chronic ETC-168 
treatment and behaviour schedule. Number of animals per treatment 
conditions for all behaviours in b-p: wild-type vehicle = 9, wild-type ETC- 

168 =10, Nign3” vehicle = 8, Nign3*° ETC-168 = 9. b, c, Mean social interaction 
time in wild-type (b) and N/lgn3* (c) mice treated for 8 days with vehicle or 
5mgkg‘ETC-168. d, Social recognition index for wild-type and Nign3”’ mice 
treated with vehicle or 5 mg kg ‘ETC-168. Numbers in brackets indicate mice. 

e, f, Individual values and mean time interacting in the social habituation/ 
recognition test after chronic treatment with ETC-168 for wild-type vehicle 
(n=9) (e), wild-type ETC-168 (n= 10) (f), Nign3“ vehicle (n= 8) (g), and Nign3”° 
ETC-168 (n= 9) (h).i, Experimental schematics of object habituation/ 
recognition test in juvenile mice. j-I, Mean object interaction time plotted for 


wild-type (j) and Nign3 (k) mice. Pvalue above graphs report trial. 1, Object 
recognition index. m, Mean velocity (cms ‘) inan open field arena during 7 min. 
n, Time spend in centre of the open field arena. o, Number of marbles buried 
during a30 min marble burying test. p, Percentage weight gain in wild-type and 
Mgn3 mice treated with ETC-168 or vehicle. Pvalue for treatment is displayed 
ongraphs. Error bars report s.e.m. Repeated measures two-way ANOVA 
followed by Bonferroni's post hoc test for genotype and treatment (b,c, j,k); 
repeated-measures two-way ANOVA followed by Bonferroni's post hoc test for 
genotype and treatment (d, I, 0), Friedman test followed by Dunn’s post hoc 
test for planned multiple comparison (e, f, h), one-way ANOVA followed by 
Bonferroni's post hoc test for planned multiple comparison (g), or 
repeated-measures two-way ANOVA (m,n, p). See Supplementary Information 
for additional statistics. 
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ERK1/2: Cell Signaling, 4695S, clone: D13.14.4E, Lot#14 
p-elF4G: Cell Signaling, 2441S, Lot#4 

elF4G: Cell Signaling, 2498, Lot#4 

p-MNK1: Cell Signaling, 2111, Lot#6 

MNK1: Cell Signaling, 2195S, Lot#5 

GAPDH: Cell Signaling, 5174, clone: D16H11, Lot#7 
calnexin: Stressgen, SPA-865, Lot#01031711 

puromycin: Kerafast, EQ0001, clone: 3RH11, Lot#2733311 
TH: Millipor, AB1542, Lot#2982635 
neurophysin-1: Millipor, MABN844, clone: PS38, Lot#3083532 


donkey anti-sheep IgG-Cy3 Jackson ImmunoResearch 713-165-147 
donkey anti-sheep Cy5 Jackson ImmunoResearch 713-175-147 
donkey anti-rabbit IgG Cy3 Jackson ImmunoResearch 711-165-152 
goat anti-mouse Cy2 Jackson ImmunoResearch 714-225-150 


Validation p-elF4E: WB: validated using positive control 293 cell lysate treated with alkaline phosphatase and HEK293 cell lysate treated 
with Dexamethasone (abcam.com). 
elF4E: evaluated by western blotting using Breast carcinoma tissue and NIH/3T3 cells extract (abcam.com) 
p-ERK1/2: evaluated by western blotting using extracts from COS cells, untreated or treated with either U0126 #9903 (10 uM for 
1h) or TPA (cellsignal.com) 
ERK1/2: evaluated by western blotting using extracts from HeLa, NIH/3T3 and C6 cells (cellsignal.com) 
p-elF4G: evaluated by western blotting using extracts from 293 cells expressing GST-elF4GI Ser1192Ala or GST-elF4GI 
Ser1108Ala mutant protein (cellsignal.com) 
elF4G: evaluated by western blotting using extracts from various cell lines (cellsignal.com) 
p-MNK1: evaluated by western blotting in NIH/3T3 cells serum starved for 24h and then treated or untreated with serum for 30 
min (cellsignal.com) 
MNK1: evaluated by western blot analysis of extracts from various cell lines (cellsignal.com) 
GAPDH: evaluated by western blotting using extracts from various cell lines (cellsignal.com) 
calnexin: evaluated by western blotting using MWM, Vero, 3T3, PC-12, and HeLa cell lines (enzolifesciences.com) 
puromycin: evaluated for western blotting (Kelleher, AR., et al. (2013). Am J Physiol Endocrinol Metab. 304(2):E229-236. 
TH: evaluated by Western Blot on mouse brain lysates (merckmillipore.com) 
neurophysin-1: evaluated for IHC (Ben-Barak, Y., et al. (1985). J. Neuroscie. 5(1):81-97) 
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Laboratory animals Mice: 
C57BL/6j, male, P5-adult 
NMRI, male, female, E16.5 
Wild animals the study did not involve wild animals 
Field-collected samples the study did not involve field-collected samples 
Ethics oversight Basel Cantonal Veterinary Office Committees for Animal Experimentation 


Lausanne Cantonal Veterinary Office Committees for Animal Experimentation 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Following the detection of the new coronavirus’ severe acute respiratory syndrome 
coronavirus 2 (SARS-CoV-2) and its spread outside of China, Europe has experienced 
large epidemics of coronavirus disease 2019 (COVID-19). In response, many European 
countries have implemented non-pharmaceutical interventions, such as the closure 
of schools and national lockdowns. Here we study the effect of major interventions 
across 11 European countries for the period from the start of the COVID-19 epidemics 
in February 2020 until 4 May 2020, when lockdowns started to be lifted. Our model 
calculates backwards from observed deaths to estimate transmission that occurred 
several weeks previously, allowing for the time lag between infection and death. We 
use partial pooling of information between countries, with both individual and shared 
effects on the time-varying reproduction number (R,). Pooling allows for more 
information to be used, helps to overcome idiosyncrasies in the data and enables 
more-timely estimates. Our model relies on fixed estimates of some epidemiological 
parameters (suchas the infection fatality rate), does not include importation or 
subnational variation and assumes that changes in R, are an immediate response to 
interventions rather than gradual changes in behaviour. Amidst the ongoing 
pandemic, we rely on death data that are incomplete, show systematic biases in 
reporting and are subject to future consolidation. We estimate that—for all of the 
countries we consider here—current interventions have been sufficient to drive R, 
below 1 (probability R,< 1.0 is greater than 99%) and achieve control of the epidemic. 
We estimate that across all 11 countries combined, between 12 and 15 million 
individuals were infected with SARS-CoV-2 up to 4 May 2020, representing between 
3.2% and 4.0% of the population. Our results show that major non-pharmaceutical 
interventions—and lockdowns in particular—have had a large effect on reducing 
transmission. Continued intervention should be considered to keep transmission of 
SARS-CovV-2 under control. 


Following the identification of the new coronavirus SARS-CoV-2 in 
Wuhan (China) in December 2019 and its global spread, large epidemics 
of COVID-19 have ensued in Europe. In response to the rising numbers of 
cases and deaths and to preserve health systems, European countries— 
as with those in Asia—have implemented measures to control their 
epidemics. These large-scale non-pharmaceutical interventions vary 
between countries, but include social distancing (such as banning large 
gatherings), border closures, school closures, measures to isolate symp- 
tomatic individuals and their contacts, and large-scale lockdowns of 
populations with all but essential internal travel banned. Understanding 
whether these interventions have had the desired effect of controlling 


the epidemic, and which interventions are necessary to maintain con- 
trol, is critical given their large economic and social costs. The key aim 
of these interventions is to reduce R,, a fundamental epidemiological 
quantity that represents the average number of infections generated at 
time t by each infected case over the course of their infection. 

In China, strict movement restrictions and other measures (includ- 
ing case isolation and quarantine) began to be introduced from 23 
January 2020, which achieved a downward trend in the number of 
confirmed new cases during February and resulted in zero new con- 
firmed indigenous cases in Wuhan by 19 March 2020. Studies have 
estimated how the values of R, changed during this time in different 
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areas of China, from around 2-4 during the uncontrolled epidemic 
to below 1 (refs. ?”). 

Estimating R, for SARS-CoV-2 presents challenges, owing to the high 
proportion of infections that are not detected by health systems!?* 
and to the regular changes in testing policies, which resulted in differ- 
ent proportions of infections being detected over time and between 
countries. Initially, most countries had the capacity to test only a small 
proportion of suspected cases and reserved tests for severely ill patients 
or for high-risk groups (for example, the contacts of positively tested 
individuals). 

An alternative way to estimate the course of the epidemic is to cal- 
culate backwards from observed deaths to the number of infections. 
We introduce a Bayesian mechanistic model linking the infection cycle 
to observed deaths, inferring the total population infected (attack 
rates) as well as R,. We assess whether there is evidence that interven- 
tions have so far been successful at reducing R, to values below 1. We 
simulate a hypothetical counterfactual scenario in which R, remains at 
starting levels to estimate the deaths that would have occurred without 
interventions. 

Reported deaths are likely to be far more reliable than case data— 
although reported death data still have limitations. First, early deaths 
attributable to COVID-19 may have been missed. Second, there is 
variation in the reporting of deaths by country and over time. Third, 
reporting delays are expected and can be both systematic and random 
in nature. We attempt to overcome these data limitations by using a 
consolidated data source, incorporating noise in our observational 
model, partially pooling of information between countries and per- 
forming a sensitivity analysis under scenarios of underreporting to 
test our conclusions (Supplementary Information). 

Our model relies on fixed estimates of some epidemiological param- 
eters, suchas the onset-to-death distribution, the infection fatality rate 
and the generation distribution, that are based on previous work>*; we 
perform a sensitivity analysis on these parameters. Our parametric 
form of R,assumes that changes in R, are an immediate response to 
interventions rather than gradual changes in behaviour, and it does 
not include importation or subnational variation. We assume that 
individual interventions have a similar effect in different countries, 
and that the efficacy of these interventions remains constant over 
time. Our framework infers R, from mortality data, while accounting 
for time lags since infections occurred. As a result, even with perfect 
data and partial pooling, we cannot perfectly predict the current value 
of R,. However, the credible intervals on R, show the self-consistent 
behaviour that is a hallmark of a fully Bayesian analysis throughout 
the entire period we study, exhibiting appropriate shrinkage as more 
data become available (Supplementary Videos 1-3). 

Italy was the first European country to begin major 
non-pharmaceutical interventions, and other countries followed soon 
afterwards (Extended Data Fig. 4). The onset of interventions ranged 
between 2 March and 29 March 2020. We analysed data on mortality 
from COVID-19 in 11 European countries until 4 May 2020, at which 
point lockdowns were relaxed in Italy and Spain. For each country, we 
model the number of infections, the number of deaths and R, (Fig. 1). 
R,is modelled as a piecewise constant function that changes only when 
anintervention occurs. Each country has its own individual starting R, 
before interventions took place. For all countries, interventions are 
assumed to have the same relative effect on R, and are informed by 
mortality data across all countries. The only exception is that we use 
partial pooling to introduce country-specific effects of the effective- 
ness of the last intervention introduced in the study period in acountry 
(which is usually lockdown). 


Estimated infections, R,and effect sizes 


Inall countries, we estimate there are orders-of-magnitude fewer infec- 
tions detected (Fig. 1, Extended Data Figs. 1,2) than trueinfections, most 
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likely owing to mild and asymptomatic infections as well as limited 
testing capacities and changes in testing policy. In Italy, our results 
suggest that—cumulatively—2.8 (2.2-3.5) million people (all parentheti- 
cal ranges refer to 95% credible intervals) have been infected as of 4 
May 2020, which gives an attack rate of 4.6% (3.6-5.8%) of the popula- 
tion (Table 1). In Spain (which has also experienced a large number of 
deaths), we estimate that 5.5% of the population (2.6 (2.1-3.3) million 
people) have been infected to date. Germany, the most populous coun- 
try in our study, is estimated to have one of the lowest attack rates at 
0.85% with 710,000 (550,000-930,000) people infected. Belgium has 
the highest estimated attack rates of 8%, followed by Spain with 5.5%. 
Although there have been few reliable national serological studies’, 
initial small-scale surveys in Austria’ and Denmark’ closely align with 
our estimates. A much larger study in Spain is very closely aligned with 
our estimates”. To some extent, these initial results validate our choice 
of infection fatality rate. 

Averaged across all countries, we estimate the initial R,to be 3.8 (2.4- 
5.6), consistent with previous analyses’. These estimates are informed 
by our choice of generation-interval distribution and the initial growth 
rate of observed deaths. A shorter assumed generation time results in 
lower starting R,(Supplementary Discussion 3). The initial values of R,are 
also uncertain, owing to (a) importation (rather than local transmission) 
being the dominant source of new infections in the early period of the 
epidemic and (b) possible under-ascertainment in deaths, particularly 
before testing became widespread. We perform sensitivity analyses 
around these parameters (Supplementary Discussions 10, 11). 

We estimate large reductions in R, in response to the combined 
non-pharmaceutical interventions. Our results—which are driven 
more by countries with advanced epidemics and larger numbers of 
deaths—suggest that these interventions have together had a substan- 
tial effect on transmission, as measured by changes in the estimated 
R,. At the time of this study, we find current estimates of R, to range 
from a posterior mean of 0.44 (0.26-0.61) for Norway to a posterior 
mean of 0.82 (0.73-0.93) for Belgium, with an average of 0.66 across 
the 11 countries—an 82% reduction compared to the pre-intervention 
values. For all countries, we find that current interventions have been 
sufficient to drive R, below 1 (probability R,< 1.0 is greater than 99% 
across all countries we consider) and achieve control of the epidemic. 
These conclusions are corroborated by studies from individual coun- 
tries—France”, Spain’, Germany“ and the UK*—over a similar period, 
which arrive at very similar estimates despite different methodologies 
and data. For example, a previous study” estimates an R, of 0.67 for 
France using hospitalization records (we estimate 0.68); for Germany, 
the Robert Koch Institute reports R, of 0.76 using electronically noti- 
fied cases’ (we estimate 0.71). The retrospective stability of our model 
(Supplementary Videos 1-3) is variable when the implementations of 
interventions are very dissimilar; an example of this is seen in Sweden, 
where interventions were dissimilar to other countries and led initially 
to large uncertainty. Our model uncertainty is also dependent on the 
magnitude of R, this occurs because infections are a nonlinear function 
of R,and are sensitive to small increases. Uncertainty shrinks greatly 
whenR, is reduced. Examples of this effect are seen in all countries, but 
it is most pronounced in Belgium and France; these countries show 
large uncertainties in the number of infections in the early period of 
the epidemic. Our choice of parameterizing R, using piecewise constant 
functions means that we cannot capture the fine-scale variation that 
could be achieved by using additional covariates. 

Lockdown has an identifiable large effect on transmission (81% 
(75-87%) reduction) (Fig. 2). The close spacing of interventions in 
time (Extended Data Fig. 4) means that the individual effects of the 
other interventions are not identifiable (Fig. 2). Our partial pooling 
model requires only one country to provide a signal for the effect of 
a given intervention, and this effect is then shared across all coun- 
tries. Although this sharing can potentially lead to initial over- or 
under-estimation of the effect of an intervention, it also means that a 
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Fig. 1| Country-level estimates of infections, deaths and R,for France, Italy, 
Spain andthe UK. Left, daily number of infections. Brown bars are reported 
infections; blue bands are predicted infections; dark blue, 50% credible 
interval; light blue 95% credible interval. The number of daily infections 
estimated by our model drops immediately after an intervention, as we assume 
that all infected people become immediately less infectious through the 


intervention. Afterwards, if R,is above 1, the number of infections will start 
growing again. Middle, daily number of deaths. Brown bars are reported 
deaths; blue bands are predicted deaths; credible intervals are as in the left 
plot. Right, R,. Dark green, 50% credible interval; light green, 95% credible 
interval. Icons are interventions, shown at the time at which they occurred. 
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Table 1| Total population infected by country 


Country Percentage of total population infected (mean (95% credible 
interval)) 
Austria 0.76% (0.59-0.98%) 
Belgium 8% (6.1-11%) 
Denmark 1.0% (0.81-1.4%) 
France 3.4% (2.7-4.3%) 
Germany 0.85% (0.66-1.1%) 
Italy 4.6% (3.6-5.8%) 
Norway 0.46% (0.34-0.61%) 
Spain 5.5% (4.4-7.0%) 
Sweden 3.7% (2.8-5.1%) 


1.9% (1.5-2.4%) 
UK 5.1% (4.0-6.5%) 


Switzerland 


Posterior model estimates of the attack rate by country (percentage of total population 
infected) as of 4 May 2020. Results are derived from a model representing 11 countries with 
a total population of 375 million and 128,928 reported COVID-19-related deaths up to 4 May 
2020. 


consistent signal for all countries can be estimated before that signal 
is present in data from an individual country”. Therefore, this sharing 
is potentially useful for generating early warnings, by leveraging what 
happened in countries with earlier epidemics to inform countries with 
more-recent epidemics. 


Estimated effect of interventions on deaths 


Extended Data Table 1 shows total deaths forecast from the beginning 
of the epidemic up to and including 4 May 2020 under our fitted model 
and under the counterfactual model, which predicts what would have 
happened if no interventions were implemented (and R,= R,; that is, 
the initial R, estimated before interventions came into effect). 

By comparing the deaths predicted under the model with no inter- 
ventions to the deaths predicted in our intervention model, we calcu- 
lated the total deaths averted in our study period. We find that across 
11 countries 3.1 (2.8-3.5) million deaths have been averted owing to 
interventions since the beginning of the epidemic; Extended Data Fig. 5 
compares the actual total deaths to the counterfactual total deaths. The 
counterfactual model without interventions is illustrative only, and 
reflects the assumptions of our model. We do not account for changes 
in behaviour; in reality, even in the absence of government interven- 
tions we would expect R, to decrease and therefore would overestimate 
deaths inthe no-intervention model. Conversely, we do not consider the 
effect on the infection fatality rate as a result of an overwhelmed health 
system in which patients may not be able to access critical care facilities, 
which would underestimate the number of counterfactual deaths. In 
the Supplementary Information, we show further counterfactual esti- 
mates under differing assumptions of the generation distribution and 
onset-to-death distribution and all scenarios broadly show the same 
trends. Given this agreement across differing scenarios, we believe 
our estimates for the counterfactual deaths averted to be plausible. 


Discussion 
During the ongoing transmission of SARS-CoV-2 in Europe, we analyse 
trends inthe numbers of deaths to assess the extent to which transmission 
has been reduced. Representing the infection process associated with 
COVID-19 using asemi-mechanistic, joint Bayesian hierarchical model, we 
canreproducetrends observed in the data relating to deaths and produce 
empirically driven predictions that are valid over short time horizons. 
We estimate that there have been many more infections than are 
currently reported. The high level of under-ascertainment of infections 
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Fig. 2 | Effectiveness of interventions onR,. Our model includes five 
covariates for governmental interventions, adjusting for whether the 
intervention was the first one undertaken by the government in response to 
COVID-19 (red) or was subsequent to other interventions (green). Mean relative 
percentage reduction inR, is shown with 95% posterior credible intervals. If 
100% reduction is achieved, R,=0 and there is no more transmission of 
COVID-19. Lockdownis significantly different from the other interventions; the 
other interventions are not significantly different from each other, probably 
owing to the fact that many interventions occurred on the same day or within 
days of each other (as shown in Extended Data Fig. 4). Results are derived froma 
model that represents 11 countries with a total population of 375 million and 
128,928 reported COVID-19-related deaths up to 4 May 2020. 


that we estimate here is probably due to the focus on testing in hospital 
settings, which misses milder or asymptomatic cases in the commu- 
nity. Despite this, we estimate that only a relatively small minority of 
individuals in each country have been infected (Table 1). Our estimates 
imply that the populations in Europe are not close to herd immunity 
(about 70% if R, is 3.8)”. Furthermore, with values of R, below 1 in all 
countries, the rate of acquisition of herd immunity will slow down rap- 
idly. Our estimates for attack rates during our study period are consist- 
ent with those reported from national serological studies’. Similarly, 
comparable studies estimating R, all agree that the number as of 4 May 
2020 is less than 1. 

To our knowledge, our modelling approach is unique in pooling infor- 
mation from multiple countries at once. Using this approach means 
that we require a central consolidated data source (such as data from 
the European Centre of Disease Control (ECDC)), and also that trends 
in some countries will be affected by those countries with more data. 
We argue that this effect is beneficial, in that it helps to minimize idi- 
osyncrasies in the data”, as well as to improve consistency of estimates 
over time. Although our qualitative conclusions surrounding the effect 
of interventions and the finding that R, is less than 1 are robust to our 
choice of whether to incorporate pooling or not, the ability to use a 
greater extent of available data and share information across countries 
ina statistically principled manner markedly improves the consist- 
ency of model predictions across the study period (Supplementary 
Videos 1-3). 

Most interventions were implemented in rapid succession in many 
countries, and as such it is difficult to disentangle the individual 
effect sizes of each intervention. In our analysis, we find that only the 
effect of lockdown is identifiable, and that it has a substantial effect 
(81% (75-87%) reduction in R,). Taking into account country-specific 
effects, the effect size of lockdown remains large across all countries 
(Supplementary Fig. 29). 

We acknowledge the limitations of existing mortality data relat- 
ing to COVID-19—in particular, deaths outside hospitals may be 
underreported. However, by using the ECDC data, we rely ona 
comprehensive data source that is refined and updated each day 
in a systematic process. Our sensitivity analysis of underreporting 
and statistical-measurement noise suggests that we may slightly 


underestimate the attack rates in some countries, but this does not 
change our overall conclusions pertaining to R,. However, even if 
the data were complete, our method cannot surmount the time lag 
between infections and deaths and can only fully identify trends in 
infections 2-3 weeks earlier. Extensions of our model could use case, 
hospitalization or intensive care data, but reconciling the different 
biases inherent in these sources while ensuring parsimony is chal- 
lenging and would require additional assumptions. 

The modern understanding of infectious disease, combined with 
a global publicized response, has meant that nationwide interven- 
tions could beimplemented with widespread adherence and support. 
Given the observed infection fatality ratios and the epidemiology of 
COVID-19, major non-pharmaceutical interventions have had an effect 
in reducing transmission in all of the countries we have considered. 
In all countries in this study, we find that these interventions have 
reduced R, below 1, and have contained their epidemics at the cur- 
rent time. When looking at simplistic counterfactual models over the 
whole epidemic, the number of potential deaths averted is substantial. 
We cannot say for certain that the current measures will continue to 
control the epidemic in Europe; however, if current trends continue 
there is reason for optimism. 
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Methods 


Data 

Our model uses daily consolidated death data from the ECDC for 
11 European countries currently experiencing the COVID-19 epidemic: 
Austria, Belgium, Denmark, France, Germany, Italy, Norway, Spain, 
Sweden, Switzerland and the UK. The ECDC provides information on 
confirmed cases and deaths attributable to COVID-19. For population 
counts, we use the United Nations Population Division age-stratified 
counts’, 

We also catalogue data on the nature and type of major 
non-pharmaceutical interventions. We looked at the government web- 
pages from each country as wellas their official public health webpages 
to identify the latest advice or laws being issued by the government and 
public health authorities. We collected the following: school closure 
ordered; case-based measures; public events banned; social distanc- 
ing encouraged; lockdown decreed; and the time of the first and last 
intervention. A full list of the timing of these interventions and the 
sources we have used is provided in the Supplementary Notes, Sup- 
plementary Table 2. 

By using the ECDC data, we rely on a consolidated data source 
compiled by the ECDC, who include many sources of data each day, 
constantly refining and updating data using a comprehensive and 
systematic process. However, despite the rigorous protocols, countries 
may vary in the specifics of the data that they report to the ECDC. For 
example, there is variation in reporting (that is, community versus hos- 
pital) and time lags. Despite these issues, we use ECDC data to ensure 
as much consistency as possible across all countries. 


Model 
A visual summary of our model is presented in Extended Data Fig. 3; 
details are provided in the Supplementary Methods. 

We fit our model to observed deaths according to ECDC data from 
11 European countries. The modelled deaths are informed by an 
infection-to-death distribution (Supplementary Fig. 1; derived from 
assumptions about the time from infection to the onset of symptoms 
and about the time from the onset of symptoms to death), and the 
population-averaged infection fatality ratio (adjusted for the age struc- 
ture and contact patterns of each country, as discussed in the Supple- 
mentary Methods, Supplementary Table 3). 

Given these distributions and ratios, modelled deaths area function 
of the number of infections. The number of infections is modelled as 
the product of R, witha discrete convolution of the previous infections. 
Individual components of this convolution sum are weighted by the 
generation time distribution (the average time from the infection of 
one person to the time at which they infect another; Supplementary 
Fig. 2). In our work, we approximate the generation time distribution 
using the serial interval distribution. R, is a function of the initial R, 
before interventions and the effect sizes from interventions, in which 
interventions are modelled as piecewise constant functions. 

Following the Bayesian hierarchy from bottom to top gives us a full 
framework to see howinterventions affect infections, which can result 
in deaths. A schematic of our model is shown in Extended Data Fig. 3. To 
maximize the ability to observe the effect of interventions on deaths, 
we fit our model jointly for all 11 European countries, and use partial 
pooling of information between countries with both individual and 
shared effects on R,. Partial pooling operates on the last intervention, 
which is—in most cases—lockdown. The effect of partial pooling can 
be seen in Supplementary Discussion 12, Supplementary Fig. 29. We 
chose a balanced prior that encodes the prior belief that interventions 
have an equal chance of having an effect or not, and ensure a uniform 
prior on the joint effect of all interventions (Supplementary Fig. 3). 
We evaluate the effect of our Bayesian prior distribution choices and 
evaluate our Bayesian posterior calibration to ensure our results are 
statistically robust. 


We perform extensive model validation and sensitivity analyses. We 
validate our model by cross-validation over a 14-day period (Supple- 
mentary Discussion 1, Supplementary Table 1) and we show the fits for 
holdout samples in Supplementary Figs. 5-15. We check the convergence 
of the Markov chain Monte Carlo sampler (Supplementary Fig. 4). We 
consider the sensitivity of our estimates of R,to the mean of the genera- 
tion distribution (Supplementary Discussion 3, Supplementary Figs. 16, 
17). We further show that the choice of generation distribution does not 
change our counterfactual conclusions (Supplementary Fig. 18). Using 
univariate analyses and uninformative priors, we find (Supplementary 
Fig. 19) that all effects on their own serve to decrease R, (Supplementary 
Discussion 4). We compare our model to a non-parametric Gaussian 
Process model (Supplementary Discussion 5). To assess the effect of 
individual countries on the results, we perform a ‘leave one country 
out’ sensitivity analysis (Supplementary Discussion 6, Supplementary 
Figs. 20, 21). To validate our starting values of R,, we compare our model 
against an exponential-growth linear model (Supplementary Discus- 
sion 7, Supplementary Fig. 22). Instead of a joint analysis, we consider 
fits of our model to individual countries (Supplementary Discussion 8, 
Supplementary Figs. 23-26). We perform a sensitivity analysis with 
respect to the onset-to-death distribution (Supplementary Discussion 9, 
Supplementary Fig. 27). We validate our probabilistic seeding scheme 
through an importance-sampling leave-one-out cross-validation (Sup- 
plementary Discussion 10). We consider a model extension with a con- 
stant, probabilistic under-reporting (Supplementary Discussion 11), 
finding that R, does not change substantially (Supplementary Fig. 28). 

Our modelis different to other approaches (such as EpiEstim”) that 
use the discrete renewal equation. We use the renewal equation as a 
latent process to model infections and propose a generative mecha- 
nism to connect these infections to death data. Simply applying the 
renewal equation directly to death data requires positing amechanism 
in which deaths in the past can cause future deaths (see, for example, 
ref.”°). In addition, for R,, we are able to use a functional relationship in 
which non-pharmaceutical interventions can havea direct effect on R,. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Death counts for the 11 European countries for the time period in our 
study and the full set of posterior draws from our model are available 
at https://reshare.ukdataservice.ac.uk/854380/. 


Code availability 


All source code and data necessary for the replication of our results 
and figures are available at https://github.com/ImperialCollegeLon- 
don/covid19model. An R package based on our method is available at 
https://imperialcollegelondon.github.io/epidemia/. 
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Extended Data Fig. 1| Country-level estimates of infections, deaths and 
R,for Belgium, Germany, Sweden and Switzerland. Left, daily number of 
infections. Brown bars are reported infections; blue bands are predicted 
infections; dark blue, 50% credible interval; light blue, 95% credible interval. 
The number of daily infections estimated by our model drops immediately 
after an intervention, as we assume that all infected people become 
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immediately less infectious through the intervention. Afterwards, if R,is above 
1, the number of infections will starts growing again. Middle, daily number of 
deaths. Brown bars are reported deaths; blue bands are predicted deaths; 
credible intervals are as in the left plot. Right, R,. Dark green, 50% credible 
interval; light green, 95% credible interval. Icons are interventions, shown at 
the time at which they occurred. 
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Extended Data Fig. 2 | Country-level estimates of infections, deaths and 
R,for Austria, Norway and Denmark. Left, daily number of infections. Brown 
bars are reported infections; blue bands are predicted infections; dark blue, 
50% credible interval; light blue, 95% credible interval. The number of daily 
infections estimated by our model drops immediately after an intervention, as 
we assume that all infected people become immediately less infectious 
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through the intervention. Afterwards, if R,is above 1, the number of infections 
will starts growing again. Middle, daily number of deaths. Brown bars are 
reported deaths; blue bands are predicted deaths; credible intervals are asin 
the left plot. Right, R,. Dark green, 50% Cl; light green, 95% CI. Icons are 
interventions, shownat the time at which they occurred. 
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Extended Data Fig. 5| Deaths averted owing to interventions. Lower and upper credible interval for the cumulative number of deaths for the 11 European 
countries in our model with interventions (pink) and the no-interventions counterfactual model (blue). Reported deaths are shownasa thick black line. 


Extended Data Table 1 | Total forecasted deaths since the beginning of the epidemic up to 4 May 2020 in our model and ina 
counterfactual model that assumes no interventions had taken place 


Forecasted deaths since the beginning of the epidemic up to 4th May in our model 
vs. a counterfactual model assuming no interventions had taken place 


oe ‘6 Model estimated deaths up Model estimated deaths up Model deaths averted up to 

Ath May to 4th May to 4th May 4th May 
Country 

(counterfactual model : 
: : ; (difference between 
(observed) (our model) assuming no interventions 
counterfactual and actual) 
have occurred) 

Austria 600 620 [520 - 720] 66,000 [40,000 - 86,000] 65,000 [40,000 - 85,000] 
Belgium 7,924 7,300 [6,400 - 8,400] 120,000 [93,000 — 140,000] 110,000 [86,000 — 130,000] 
Denmark 493 500 [430 - 590] 34,000 [17,000 - 50,000] 34,000 [17,000 - 49,000] 
France 25,201 23,000 [21,000 - 27,000] 720,000 [590,000 - 850,000] 690,000 [570,000 - 820,000] 
Germany 6,831 6,800 [6,000 - 7,900] 570,000 [370,000 - 780,000] 560,000 [370,000 - 770,000] 
Italy 29,079 31,000 [27,000 - 35,000] 670,000 [540,000 - 800,000] 630,000 [510,000 - 760,000] 
Norway 208 210 [170 - 250] 12,000 [3,400 - 24,000] 12,000 [3,200 - 23,000] 
Spain 25,613 25,000 [22,000 - 28,000] 470,000 [390,000 - 560,000] 450,000 [360,000 — 540,000] 
Sweden 2,769 2,800 [2,500 - 3,300] 28,000 [15,000 - 49,000] 26,000 [12,000 - 46,000] 
Switzerland 1,476 1,500 [1,300 - 1,800] 54,000 [36,000 - 73,000] 52,000 [34,000 - 71,000] 
United 28,734 29,000 [25,000 - 34,000] 500,000 [400,000 - 610,000] 470,000 [370,000 — 580,000] 
Kingdom 
All 128,928 130,000 3,200,000 3,100,000 


[120,000 - 140,000] 


[2,900,000 - 3,600,000] 


[2,800,000 - 3,500,000] 


Estimated averted deaths over this time period as a result of the interventions. Numbers in brackets are 95% credible intervals. 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 
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Governments around the world are responding to the coronavirus disease 2019 
(COVID-19) pandemic’, caused by severe acute respiratory syndrome coronavirus 2 
(SARS-CoV-2), with unprecedented policies designed to slow the growth rate of 
infections. Many policies, such as closing schools and restricting populations to their 
homes, impose large and visible costs on society; however, their benefits cannot be 
directly observed and are currently understood only through process-based 
simulations” *. Here we compile data on 1,700 local, regional and national 
non-pharmaceutical interventions that were deployed in the ongoing pandemic 
across localities in China, South Korea, Italy, Iran, France and the United States. We 
then apply reduced-form econometric methods, commonly used to measure the 
effect of policies on economic growth**, to empirically evaluate the effect that these 
anti-contagion policies have had on the growth rate of infections. In the absence of 
policy actions, we estimate that early infections of COVID-19 exhibit exponential 
growth rates of approximately 38% per day. We find that anti-contagion policies have 
significantly and substantially slowed this growth. Some policies have different effects 
on different populations, but we obtain consistent evidence that the policy packages 


that were deployed to reduce the rate of transmission achieved large, beneficial and 
measurable health outcomes. We estimate that across these 6 countries, 
interventions prevented or delayed on the order of 61 million confirmed cases, 
corresponding to averting approximately 495 million total infections. These findings 
may help to inform decisions regarding whether or when these policies should be 
deployed, intensified or lifted, and they can support policy-making in the more than 
180 other countries in which COVID-19 has been reported’. 


The COVID-19 pandemic is forcing societies worldwide to make 
consequential policy decisions with limited information. After con- 
tainment of the initial outbreak failed, attention turned to imple- 
menting non-pharmaceutical interventions that are designed to slow 
the contagion of the virus. In general, these policies aim to decrease 
virus transmission by reducing contact among individuals within or 
between populations, such as by closing restaurants or restricting 
travel, thereby slowing the spread of COVID-19 to a manageable rate. 
These large-scale anti-contagion policies are informed by epidemio- 
logical simulations”**? and a small number of natural experiments 
during past epidemics’’. However, the actual effects of these policies 
on infection rates in the ongoing pandemic are unknown. Because 
the modern world has never confronted this pathogen, nor deployed 
anti-contagion policies of suchscale and scope, it is crucial that direct 
measurements of the effects of policies are used together with numeri- 
cal simulations in current decision-making. 

Societies around the world are considering whether the health ben- 
efits of anti-contagion policies are worth their social and economic 


costs. Many of these costs are clearly observed; for example, business 
restrictions increase unemployment and school closures affect edu- 
cational outcomes. It is therefore not surprising that some popula- 
tions have hesitated before implementing such policies, especially 
when their costs are visible while their health benefits—infections and 
deaths that would have occurred but are instead avoided or delayed— 
are unseen. Our objective is to measure the direct health benefits of 
these policies; specifically, how much these policies slowed the growth 
rate of infections. To do this, we compare the growth rate of infec- 
tions within hundreds of subnational regions before and after each of 
these policies is implemented locally. Intuitively, each administrative 
unit observed immediately before a policy deployment serves as the 
‘control’ for the same unit in the days after it receives a policy ‘treat- 
ment’ (see Supplementary Information for accounts of these deploy- 
ments). Our hope isto learn from the recent experience of six countries 
in which the early spread of the virus triggered large-scale policy 
actions, in part so that societies and decision-makers everywhere 
can access this information. 
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Berkeley, Berkeley, CA, USA. ’Electrical Engineering & Computer Science Department, UC Berkeley, Berkeley, CA, USA. “e-mail: shsiang@berkeley.edu 
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Fig. 1|Dataon COVID-19 infections and large-scale anti-contagion policies. 
Left, daily cumulative confirmed cases of COVID-19 (solid black line, left axis) 
and deaths (dashed black line) over time. Vertical lines are deployments of 
anti-contagion policies, for which the height indicates the number of 
administrative units that instituted a policy that day (right axis). For display 
purposes only, <5 policy types are shown per country and missing case data are 
imputed unless all subnational units are missing. Right, maps of cumulative 
confirmed cases by administrative unit on the last date of each sample. 


Here we directly estimate the effects of 1,700 local, regional and 
national policies on the growth rate of infections across localities 
within China, South Korea, Italy, Iran, France and the United States 
(Fig. 1and Supplementary Table 1). We compile subnational data on 
daily infection rates, changes in case definitions and the timing of policy 
deployments, including (1) travel restrictions, (2) social distancing 
through the cancellations of events and suspensions of educational, 
commercial and religious activities, (3) quarantines and lockdowns, 
and (4) additional policies such as emergency declarations and expan- 
sions of paid sick leave, from the earliest available dates to 6 April 2020 


(Extended Data Fig. 1 and Supplementary Notes). During this period, 
populations remained almost entirely susceptible to COVID-19, causing 
the natural spread of infections to exhibit almost perfect exponential 
growth" ©, The rate of this exponential growth could change daily, 
determined by epidemiological factors, such as disease infectivity, as 
wellas policies that alter behaviour®”. Because policies were deployed 
while the epidemic unfolded, we can estimate their effects empirically. 
We examine how the daily growth rate of infections in each locality 
changed in response to the collection of ongoing policies applied to 
that locality on that day. 

We use well-established reduced-form econometric techniques** 
that are commonly used to measure the effects of events®” on eco- 
nomic growth rates. Similar to early COVID-19 infections, economic 
output generally increases exponentially with a variable rate that can be 
affected by policies and other conditions. Here, this technique aims to 
measure the total magnitude of the effect of changes in policy, without 
requiring explicit prior information about fundamental epidemiologi- 
cal parameters or mechanisms, many of which remain uncertain inthe 
current pandemic. Instead, the collective influence of these factors is 
empirically recovered from the data without modelling their individual 
effects explicitly (see Methods). Previous research on influenza’, for 
example, has shown that such statistical approaches can provide impor- 
tant complementary information to process-based models. 

Toconstruct the dependent variable, we transform location-specific, 
subnational time-series data on infections into first differences of 
their natural logarithm, whichis the per-day growth rate of infections 
(see Methods). We use data from first- or second-level administrative 
units and data on active or cumulative cases, depending on availability 
(Supplementary Notes). We employ widely used panel regression mod- 
els*“* to estimate how the daily growth rate of infections changes over 
time within a location when different combinations of large-scale poli- 
cies are enacted (see Methods). Our econometric approach accounts 
for differences in the baseline growth rate of infections across subna- 
tional locations, which may be affected by time-invariant characteris- 
tics, suchas demographics, socioeconomic status, culture and health 
systems; it accounts for systematic patterns in growth rates within 
countries unrelated to policy, such as the effect of the workweek; it is 
robust to systematic undersurveillance specific to each subnational 
unit; and it accounts for changes in procedures to diagnose positive 
cases (Methods and Supplementary Methods). 

We estimate that in the absence of policies, early infection rates 
of COVID-19 grow 43% per day on average across these six countries 
(s.e.m. = 5%), implying a doubling time of approximately 2 days. 
Country-specific estimates range from 34% per day inthe United States 
(s.e.m. =7%) to 68% per day in Iran (s.e.m. =9%). We cannot determine 
whether the high estimate for Iran results from true epidemiologi- 
cal differences, data-quality issues (see Methods), the concurrence 
of the initial outbreak with a major religious holiday and pilgrimage 
(Supplementary Notes) or sampling variability. Excluding Iran, the 
average growth rate is 38% per day (s.e.m.=5%). Growth rates in all five 
other countries are independently estimated to be very near this value 
(Fig. 2a). These estimated values differ from observed average growth 
rates because the latter are confounded by the effects of policies. These 
growth rates are not driven by the expansion of testing or increasing 
rates of case detection (Methods and Extended Data Fig. 2) nor by data 
from individual regions (Extended Data Fig. 3). 

Some previous analyses of pre-intervention infections in Wuhan 
have suggested that the growth rates were slower (doubling every 
5-7 days)’ using data collected before national standards for diag- 
nosis and case definitions were first issued by the Chinese govern- 
ment on 15 January 2020”. However, case data in Wuhan from before 
this date contain multiple irregularities: the cumulative case count 
decreased on 9 January 2020; no newcases were reported for 9-15 Janu- 
ary; and there were concerns that information about the outbreak was 
suppressed”? (Supplementary Table 2). When we remove these data, 
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Fig. 2| Empirical estimates of unmitigated COVID-19 infection growth rates 
and the effect of anti-contagion policies. Markers are country-specific 
estimates, whiskers show the 95% confidence interval. Columns report effect 
sizes as a change in the continuous-time growth rate (95% confidence intervals 
are shown in parentheses) and the day-over-day percentage growth rate. a, 
Estimates of daily COVID-19 infection growth rates in the absence of policy 
(dashed lines, averages with and without Iran, both excluding the Wuhan-specific 
estimate). b, Estimated combined effect of all policies on infection growth rates. 
c, Estimated effects of individual policies or policy groups on the daily growth 
rate of infections, jointly estimated and ordered roughly chronologically within 
each country. The asterisks indicate that the reported effect of ‘home isolation’ 
includes effects of other implied policies (see Methods). WFH denotes work from 
home policy. China, n=3,669; South Korea, n= 595; Italy, n=2,898; Iran, n=548; 
France, n=270; United States, n=1,238. 


using a shorter but more reliable pre-intervention time series from 
Wuhan (16-21 January), we recover a growth rate of 43% per day 
(s.e.m. =3%), which corresponds to a doubling time of 2 days, consist- 
ent with results from all other countries except Iran (Fig. 2a). 

During the early stages of an epidemic, a large proportion of the 
population remains susceptible to the virus, and if the spread of the 
virus is left uninhibited by changes in policies or behaviour, exponential 
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growth continues until the fraction of the susceptible population 
decreases meaningfully"*”"”. After correcting for estimated rates 
of case detection”’, we compute that the minimum susceptible fraction 
across administrative units in our sample is 72% of the total population 
(Cremona, Italy) and 87% of administrative units would be likely to be 
inaregime of uninhibited exponential growth (that is, more than 95% 
of the population remains susceptible) if policies were removed on 
the last date of our sample. 

Consistent with predictions from epidemiological models?”°™, we 
find that the combined effect of policies within each country reduces 
the growth rate of infections by asubstantial and statistically significant 
amount (Fig. 2b and Supplementary Table 3). For example, a locality 
in France with a baseline growth rate of 0.33 (national average) that 
fully deployed all policy actions used in France would be expected 
to lower its daily growth rate by —0.17 to a growth rate of 0.16. In gen- 
eral, the estimated total effects of policy packages are large enough 
that they can in principle offset a large fraction of, or even eliminate, 
the baseline growth rate of infections—although in several countries, 
many localities have not deployed the full set of policies. Overall, the 
estimated effects of all policies combined are generally insensitive to 
withholding regional (that is, state- or province-level) blocks of data 
from the sample (Extended Data Fig. 3). 

In China, only three policies were enacted across 115 cities earlyina 
7-week period, providing us with sufficient data to empirically estimate 
how the effects of these policies evolved over time without making 
assumptions about the timing of these effects (Fig. 2b and Methods). 
We estimate that the combined effect of these policies reduced the 
growth rate of infections by —0.026 (s.e.m. = 0.046) in the first week 
after they came into effect, increasing substantially in the second week 
to -0.20 (s.e.m. = 0.049), and essentially stabilizing in the third week 
around —0.28 (s.e.m. = 0.047). In other countries, we lack sufficient 
data to estimate these temporal dynamics explicitly and only report the 
average pooled effect of policies across all days after their deployment 
(Methods). If other countries have transient responses similar to China, 
we would expect that the effects in the first week after deployment are 
smaller in magnitude than the average effect that we report. We also 
explore how our estimates would change if we impose the assumption 
that policies cannot affect infection growth rates until after a fixed num- 
ber of days (Extended Data Fig. 5a and Supplementary Methods sec- 
tion 3); however, we do not find evidence that this improves model fit. 

The estimates described above (Fig. 2b) capture the superposition 
of all policies deployed in each country; that is, they represent the 
average effect of policies that we would expect to observe if all policies 
enacted anywhere in each country were implemented simultaneously 
in asingle region of that country. We also estimate the effects of indi- 
vidual policies or clusters of policies (Fig. 2c) that are grouped based 
on either their similarity in goal (for example, library and museum 
closures) or timing (for example, policies deployed simultaneously). 
Our estimates for these individual effects tend to be statistically noisier 
than the estimates for all policies combined. Some estimates for the 
same policy differ between countries, perhaps because policies are not 
implemented identically or because populations behave differently. 
Nonetheless, 22 out of 29 point estimates indicate that individual poli- 
cies are probably contributing to the reduction of the growth rate of 
infections. Seven policies (one in South Korea, two in Italy and four in 
the United States) have point estimates that are positive, six of which 
are small in magnitude (less than 0.1) and not statistically different 
from zero (5% level). Consistent with greater overall uncertainty in 
these disaggregated estimates, some of the estimates in China, South 
Korea, Italy and France are moderately more sensitive to withholding 
regional blocks of data (Extended Data Fig. 4), but remain broadly 
robust to the assumption of a constant delayed effect of all policies 
(Extended Data Fig. 5b). 

On the basis of these results, we find that the deployment of 
anti-contagion policies in all six countries significantly slowed the 
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Fig. 3 | Estimated infection growth rates based on actual anti-contagion 
policies and inano-policy counterfactual scenario. Predicted daily growth 
rates of active (China and South Korea) or cumulative (all others) COVID-19 
infections based on the observed timing of all policy deployments within each 
country (blue) and ina scenario in which no policies were deployed (red). The 
difference between these two predictions is our estimated effect of actual 
anti-contagion policies onthe growth rate of infections. Small markers are 
daily estimates for subnational administrative units (vertical lines are 95% 
confidence interval). Large markers are national averages. Black circles are 
observed daily changes inlog(infections), averaged across administrative 


units. Sample sizes are the same as in Fig. 2. 


pandemic. We combine the estimates above with our data on the tim- 
ing of the 1,700 policy deployments to estimate the total effect of all 
policies across the dates in our sample. To do this, we use our estimates 
to predict the growth rate of infections in each locality on each day, 
given the actual policies in effect at that location on that date (Fig. 3). 
We then use the same model to predict what the counterfactual growth 
rates would be on that date if the effects of all policies were removed 
(Fig. 3), which we call the no-policy scenario. The difference between 
these two predictions is our estimate of the effect that all deployed 
policies had on the growth rate of infections. During our sample, we 


estimate that all policies combined slowed the average growth rate of 
infections by —0.252 per day (s.e.m.=0.045, P< 0.001) in China, -0.248 
(s.e.m. = 0.089, P< 0.01) in South Korea, —0.24 (s.e.m.=0.068, P< 0.001) 
inItaly, -0.355 (s.e.m. = 0.063, P< 0.001) in Iran, —0.123 (s.e.m.=0.019, 
P<0.001) in France and -0.084 (s.e.m. = 0.03, P< 0.01) in the United 
States. These results are robust to modelling the effects of policies 
without grouping them (Extended Data Fig. 6a and Supplementary 
Table 4) or assuming a delayed effect of policy on infection growth 
rates (Supplementary Table 5). 

The number of COVID-19 infections ona date depends onthe growth 
rate of infections on all previous days. Thus, persistent reductions in 
growth rates have a compounding effect on infections, until growth 
is slowed by a shrinking susceptible population. To provide a sense of 
scale for our results, we integrate the growth rate of infections in each 
locality from Fig. 3 to estimate cumulative infections, both with actual 
anti-contagion policies and inthe no-policy scenario. To account for the 
declining susceptible population in each administrative unit, we couple 
our econometric estimates of the effects of policies with a susceptible— 
infected-removed model” that adjusts the susceptible population in 
each administrative unit based on estimated case-detection rates™”> 
(see Methods). This allows us to extend our projections beyond the 
initial exponential growth phase of infections, a threshold that many 
localities cross in our no-policy scenario. 

Our results suggest that anti-contagion policies have already sub- 
stantially reduced the number of COVID-19 infections observed inthe 
world at present (Fig. 4). Our central estimates suggest that there would 
be approximately 37 million more cumulative confirmed cases (corre- 
sponding to 285 million more total infections, including the confirmed 
cases by 5 March 2020) in China, 11.5 million more confirmed cases 
(38 million total infections by 6 April 2020) in South Korea, 2.1 million 
more confirmed cases (49 million total infections by 6 April 2020) in 
Italy, 4.9 million more confirmed cases (54 million total infections by 
22 March 2020) in Iran, 280,000 more confirmed cases (9 million total 
infections by 25 March 2020) in France and 4.8 million more confirmed 
cases (60 million total infections by 6 April 2020) in the United States 
had these countries never enacted any anti-contagion policies since 
the start of the pandemic.The magnitudes of these impacts partially 
reflect the timing, intensity and extent of policy deployment (for 
example, how many localities deployed policies) and the duration for 
which they have been applied. Several of these estimates are subject to 
large statistical uncertainties (see intervals in Fig. 4). Sensitivity tests 
(Extended Data Fig. 7) that assume a range of plausible alternative 
parameter values relating to disease dynamics, such as incorporat- 
ing a susceptible-exposed-infected-removed model, suggest that 
interventions may have reduced the severity of the outbreak by atotal 
of 54-65 million confirmed cases over the dates in our sample (central 
estimates). Sensitivity tests in which the assumed infection-fatality 
ratiois varied (Supplementary Table 6) suggest a corresponding range 
of 46-77 million confirmed cases (490-580 million total infections). 

Our empirical results indicate that large-scale anti-contagion policies 
have slowed the COVID-19 pandemic. Because infection rates in the 
countries that we studied would have initially followed rapid exponen- 
tial growth had no policies been applied, our results suggest that these 
policies have provided large health benefits. For example, we estimate 
that there would be approximately 465x the observed number of con- 
firmed cases in China, 17x the number in Italy and 14x the number in the 
United States by the end of our analysis if large-scale anti-contagion 
policies had not been deployed. Consistent with process-based simula- 
tions of COVID-19 infections?**””””*, our analysis of existing policies 
indicates that seemingly small delays in policy deployment are likely 
to have produced markedly different health outcomes. 

Although the limitations of available data pose challenges to our 
analysis, our aim is to use what data exist to estimate the first-order 
effects of unprecedented policy actions in an ongoing global crisis. 
As more data become available, related findings will become more 
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Fig. 4| Estimated cumulative confirmed COVID-19 infections with and 
without anti-contagion policies. The predicted cumulative number of 
confirmed COVID-19 infections based on actual policy deployments (blue) and 
inthe no-policy counterfactual scenario (red). Shaded areas show uncertainty 
based 0n1,000 simulations for which empirically estimated parameters are 
resampled from their joint distribution (dark, inner 70% of predictions; light, 
inner 95%). Black dotted line is observed cumulative infections. Infections are 
not projected for administrative units that never report infections inthe 
sample, but which might have experienced infections inano-policy scenario. 


precise and may capture more complex interactions. Furthermore, 
this analysis does not account for interactions between populations 
in nearby localities’, nor mobility networks***”. Nonetheless, we hope 
that these results can support critical decision-making, both in the 
countries that we study and in the more than 180 other countries in 
which COVID-19 infections have been reported’. 

Akey advantage of our reduced-form top-down statistical approach 
is that it captures the real-world behaviour of affected populations 
without requiring that we explicitly model the underlying mechanisms 
and processes. This is useful in the current pandemic, for which many 
process-related parameters remain uncertain. However, our results 
cannot and should not be interpreted as a substitute for bottom-up 
process-based epidemiological models that are specifically designed 
to provide guidance in public health crises. Rather, our results com- 
plement existing models, for example, by helping to calibrate key 
model parameters. We believe both forward-looking simulations and 
backward-looking empirical evaluations should be used to inform 
decision-making. 
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Our analysis measures changes in local infection growth rates asso- 
ciated with changes in anti-contagion policies. A necessary condition 
for this association to be interpreted as the plausibly causal effect of 
these policies is that the timing of policy deployment is independent 
of infection growth rates“. This assumption is supported by estab- 
lished epidemiological theory"? and evidence*®”, which indicate 
that infections inthe absence of policy will grow exponentially early in 
the epidemic, implying that pre-policy infection growth rates should be 
constant over time and therefore uncorrelated with the timing of policy 
deployment. Furthermore, scientific guidance to decision-makers early 
inthe current epidemic explicitly projected constant growth rates inthe 
absence of anti-contagion measures, limiting the possibility that antici- 
pated changes in natural growth rates affected decision-making?” 
In practice, policies tended to be deployed in response to the high total 
numbers of cases (for example, in France)”, in response to outbreaks 
in other regions (for example, in China, South Korea and Iran)*, after 
delays due to political constraints (for example, in the United States 
and Italy) and often with timings that coincided with arbitrary events, 
such as weekends or holidays (see Supplementary Notes for detailed 
chronologies). 

Our analysis accounts for documented changes in COVID-19 testing 
procedures and availability, as well as differences in case detection 
across locations; however, unobserved trends incase detection could 
affect our results (see Methods). We analyse estimated case-detection 
trends” (Extended Data Fig. 2) and find that this potential bias is small— 
possibly elevating our estimated no-policy growth rates by 0.026 (7%) 
onaverage. 

Itis also possible that changing public knowledge during the period 
of our study affects our results. If individuals alter their behaviour in 
response to new information unrelated to anti-contagion policies, 
suchas seeking out online resources, this could alter the growth rate of 
infections and thus affect our estimates. If increasing availability of 
information reduces infection growth rates, it would cause us to over- 
state the effectiveness of anti-contagion policies. We note, however, 
that if public knowledge is increasing in response to policy actions, 
suchas through news reports, then it should be considered a pathway 
through which policies alter infection growth, not a form of bias. Inves- 
tigating these potential effects is beyond the scope of this analysis, but 
it isan important topic for future investigations. 

Finally, our analysis focuses on confirmed infections, but other 
outcomes, such as hospitalizations or deaths, are also of policy 
interest. Future studies on these outcomes may require additional 
modelling approaches because they are relatively more context- and 
state-dependent. Nonetheless, we experimentally implement our 
approach on the daily growth rate of hospitalizations in France, the 
only country in our sample for which hospitalization data are available 
at the granularity of this study. We find that the total estimated effect of 
anti-contagion policies onthe growth rate of hospitalizations is similar 
to our estimates for infection growth rates (Extended Data Fig. 6c). 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


Data collection and processing 

We provide a brief summary of our data collection processes here; 
further details, including access dates are provided inthe Supplemen- 
tary Notes. Epidemiological data, case definitions/testing regimes and 
policy data for each of the six countries in our sample were collected 
froma variety of in-country data sources, including government pub- 
lic health websites, regional newspaper articles and crowd-sourced 
information on Wikipedia. The availability of epidemiological and 
policy data varied across the six countries, and preference was given 
to the collection of data at the most granular administrative unit level. 
The country-specific panel datasets are at the regional level in France, 
the state level in the United States, the province level in South Korea, 
Italy and Iran, and the city level in China. Owing to data availability, the 
sample dates differ across countries: in China we use data from 16 Janu- 
ary to 5 March 2020; in South Korea from 17 February to 6 April 2020; 
in Italy from 26 February to 6 April 2020; in Iran from 27 February to 
22 March 2020; in France from 29 February to 25 March 2020; and in 
the United States from 3 March to 6 April 2020. Our data sources are 
described in more detail below. 


China. We acquired epidemiological data from an open-source GitHub 
project™ that scrapes time series data from Ding Xiang Yuan, a Chinese 
website that integrates COVID-19 epidemiological data from various 
local governments. We extended this dataset back in time to 10 January 
2020 by manually collecting official daily statistics from the central 
and provincial (Hubei, Guangdong and Zhejiang) Chinese government 
websites. We compiled policies by collecting data on the start dates of 
emergency declarations, travel bans and lockdowns at the city level 
from the ‘2020 Hubei lockdowns’ Wikipedia page® and various other 
news reports. We suspect that most Chinese cities have implemented 
at least one anti-contagion policy due to their reported trends in in- 
fections; as such, we dropped cities for which we could not identify a 
policy deployment date to avoid miscategorizing the policy status of 
these cities. Thus our results are only representative for the sample of 
115 cities for which we obtained policy data. 


South Korea. We manually collected and compiled the epidemiologi- 
cal dataset for South Korea, based on provincial government reports, 
policy briefings and news articles. We compiled policy actions from 
news articles and press releases from the Korean Centers for Disease 
Control and Prevention, the Ministry of Foreign Affairs and websites 
of local governments. 


Iran. We used epidemiological data from the table ‘New COVID-19 
cases in Iran by province” in the ‘2020 coronavirus pandemic in Iran’ 
Wikipedia article, which were compiled from data provided by the 
Iranian Ministry of Health website (in Persian). We relied on news report- 
ing and two timelines of pandemic events in Iran**” to collate policy 
data. From 2 March to 3 March 2020, Iran did not report subnational 
cases. Around this period, the country implemented three national poli- 
cies: arecommendation against local travel (1 March), work from home 
for government employees (3 March) and school closure (5 March). As 
the effects of these policies cannot be distinguished from each other 
due to the data gap, we group them together for the purpose of this 
analysis. 


Italy. We used epidemiological data from the GitHub repository” main- 
tained by the Italian Department of Civil Protection (Dipartimento della 


Protezione Civile). For policies, we primarily relied on the English ver- 
sion of the COVID-19 dossier ‘Chronology of main steps and legal acts 
taken by the Italian Government for the containment of the COVID-19 
epidemiological emergency’ written by the Dipartimento della 
Protezione Civile®’, and Wikipedia*®. 


France. We used the region-level epidemiological dataset provided 
by the government website of France*! and supplemented it with the 
number of confirmed cases by region on the public health website 
of France, which was previously updated daily until 25 March*”. We 
obtained data on the policy response to the COVID-19 pandemic from 
the French government website, press releases from each regional 
public health site* and Wikipedia“. 


United States. We used state-level epidemiological data from usafacts. 
org’, which are compiled from multiple sources. For policy responses, 
we relied onanumber of sources, including the US Centers for Disease 
Control and the National Governors Association, as well as various 
executive orders from county- and city-level governments, and press 
releases from media outlets. 


Policy data. Policies in administrative units were coded as binary vari- 
ables, for which the policy was coded as either 1 (after the date that the 
policy was implemented and before it was removed) or 0 (otherwise) 
for the affected administrative units. When a policy only affected a frac- 
tion of an administrative unit (for example, half of the counties within 
a state), policy variables were weighted by the percentage of people 
within the administrative unit who were treated by the policy. We used 
the most recent population estimates we could find for the administra- 
tive units of countries (see the ‘Population Data’ section in the Supple- 
mentary Information). To standardize policy types across countries, 
we mapped each country-specific policy to one of the broader policy 
category variables in our analysis. In this exercise, we collected 168 
policies for China, 59 for South Korea, 214 for Italy, 23 for Iran, 59 for 
France and 1,177 for the United States (Supplementary Table 1). There 
are some cases for which we encode policies that are necessarily in 
effect whenever another policy is in place, owing in particular to the 
far-reaching implications of home-isolation policies. In China, wherever 
home isolation is documented, we assume a local travel ban is enacted 
onthe same day if we have not found an explicit local travel ban policy 
for a given locality. In France, we assume homeisolation is accompanied 
by event cancellations, social distancing and no-gathering policies; in 
Italy, we assume home isolation entails no-gathering, local travel ban, 
work from home and social distancing policies; in the United States, 
we assume shelter-in-place orders indicate that non-essential business 
closures, work from home policies and no-gathering policies are in 
effect. For policy types that are enacted multiple times at increasing 
degrees of intensity within a locality, we add weights to the variable by 
escalating the intensity from 0 pre-policy in steps up to 1 for the final 
version of the policy (see the ‘Policy Data’ section in the Supplementary 
Information). 


Epidemiological data. We collected information on cumulative con- 
firmed cases, cumulative recoveries, cumulative deaths, active cases 
and any changes to domestic COVID-19-testing regimes, such as case 
definitions or testing methodology. For our regression analysis (Fig. 2), 
we use active cases when they are available (China and South Korea) and 
cumulative confirmed cases otherwise. We document quality-control 
steps in the Supplementary Information. For China and South Korea, 
we acquired more granular data than the data hosted on the Johns 
Hopkins University (JHU) interactive dashboard**; we confirm that 
the number of confirmed cases closely match between the two data 
sources (see Extended Data Fig. 1). To conduct the econometric analysis, 
we merge the epidemiological and policy data to forma single data set 
for each country. 


Econometric analysis 

Reduced-form approach. The reduced-form econometric approach 
that we apply here is a ‘top-down’ approach that describes the behav- 
iour of aggregate outcomes yin data (in this case, infection rates). This 
approach can identify plausibly causal effects* induced by exogenous 
changes in independent policy variables z (for example, school closure) 
without explicitly describing all underlying mechanisms that link ztoy, 
without observing intermediary variables x (for example, behaviour) 
that might link zto y, or without other determinants of y unrelated to 
z(for example, demographics), denoted w. Let f(-) describe acomplex 
and unobserved process that generates infection rates y: 


VHS (Gp er Zi), vr Ay (Zp oer Ze), Wy oe Wy) (1) 
Process-based epidemiological models aim to capture elements of 
f(-) explicitly, and then simulate how changes in z, x or w affect y. This 
approach is particularly important and useful in forward-looking simu- 
lations in which future conditions are likely to be different than histori- 
cal conditions. However, a challenge faced by this approach is that we 
may not knowthe full structure of (-), for example, if a pathogen is new 
and many key biological and societal parameters remain uncertain. We 
may not know the effect that large-scale policy (z) willhave on behaviour 
(x(z)) or how this behaviour change will affect infection rates (f(-)). 

Alternatively, one can differentiate equation (1) with respect to the 
kth policy z,: 
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which describes how changes in the policy affects infections through 
all N potential pathways mediated by x, ..., x). Usefully, for a fixed 
population observed over time, empirically estimating an average 
value of the local derivative on the left side in equation (2) does not 
depend on explicit knowledge of w. If we can observe y and z directly 
and estimate changes over time 2 with data, then intermediate 
variables.xalso need not be observed nor modelled. The reduced-form 
econometric approach’ thus attempts to measure » directly, exploit- 
ing exogenous variation in policies z. 


Model. Active infections grow exponentially during the initial phase 
of an epidemic, when the proportion of immune individuals in a 
population is near zero. Assuming a simple susceptible-infected- 
recovered (SIR) disease model", the growth in infections during the 
early period is 


di, 
de =(S,B- yh Spl (B- yh, (3) 


where /, is the number of infected individuals at time ¢, 6 is the 
transmission rate (new infections per day per infected individual), 
y is the removal rate (proportion of infected individuals recovering 
or dying each day) and Sis the fraction of the population susceptible to 
the disease. The second equality holds inthe limit S > 1, which describes 
conditions during the beginning of the COVID-19 pandemic. The 
solution to this ordinary differential equation is the exponential function 
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where I, is the initial condition. Taking the natural logarithm and rear- 
ranging, we have 


log(/,.) - log(/,,) =8(- t). (5) 


Anti-contagion policies are designed to alter g, through changes to 
B, by reducing contact between susceptible and infected individuals. 


Holding the time step between observations fixed at one day (t,-¢,=1), 
we thus model gas a time-varying outcome that is a linear function of 
a time-varying policy 


&, =log(/,) — log(/,_1) = 89+ Opolicy, + €;, (6) 


where 6, is the average growth rate without a policy, policy,is a binary 
variable describing whether a policy is deployed at time ¢, and 0 is the 
average effect of the policy on growth rate g over all periods subsequent 
to the introduction of the policy, thereby encompassing any lagged 
effects of policies. €,is amean-zero disturbance term that captures 
interperiod changes not described by policy,. Using this approach, 
infections each day are treated as the initial conditions for integrating 
equation (4) through to the following day. 

We compute the first differences log(/,) —log(/,_,) using active infec- 
tions in countries for which they are available, otherwise we use cumula- 
tive infections, noting that they are almost identical during this early 
period (except in China, where we use active infections). We then match 
these data to policy variables that we construct using the novel datasets 
that we assembled and apply a reduced-form approach to estimate a 
version of equation (6), although the actual expression has additional 
terms detailed below. 


Estimation. To estimate a multi-variable version of equation (6), we 
estimate a separate regression for each country c. Observations are for 
subnational units indexed by iobserved for each day t. Because notall 
localities began testing for COVID-19 onthe same date, these samples 
are unbalanced panels. To ensure data quality, we restrict our analysis 
to localities after they have reported at least ten cumulative infections. 

Anecessary condition for unbiased estimates is that the timing of 
policy deployment is independent of natural infection growth rates“, a 
mathematical condition that should be true in the context of anewepi- 
demic. In established epidemiological models, including the standard 
SIR model above, early rates of infection within a susceptible population 
are characterized by constant exponential growth. This phenomenon 
is well understood theoretically’*””*’, has been repeatedly documented 
in past epidemics”®”*** as well as the current COVID-19 pandemic”, 
and implies constant infection growth rates in the absence of policy 
intervention. Thus, we treat changes in infection growth rates as 
conditionally independent of policy deployments since the correlation 
between a constant variable and any other variable is zero in expectation. 

We estimate a multiple regression version of equation (6) using 
ordinary least squares. We include a vector of subnational unit fixed 
effects 0, (thatis, varying intercepts captured as coefficients to dummy 
variables) to account for all time-invariant factors that affect the local 
growth rate of infections, such as differences in demographics, socio- 
economic status, culture and health systems>. We include a vector 
of day-of-week fixed effects 6 to account for weekly patterns in the 
growth rate of infections that are common across locations within a 
country; however, in China, we omit day-of-week effects because we 
find no evidence they are present in the data—perhaps because of the 
fact that the outbreak of COVID-19 began during a national holiday and 
workers never returned to work. We also include a separate single-day 
dummy variable each time there is an abrupt change in the availability 
of COVID-19 testing or achange in the procedure to diagnose positive 
cases. Such changes generally manifest as a discontinuous jump in 
infections and are-scaling of subsequent infection rates (for example, 
see China in Fig. 1), effects that are flexibly absorbed by a single-day 
dummy variable because the dependent variable is the first difference 
of the logarithm of infections. We denote the vector of these effects pt. 

Lastly, we includea vector (length P.) of country-specific policy vari- 
ables (policy) for each location and day. These policy variables take on 
values between 0 and 1 (inclusive) where 0 indicates no policy action 
and lindicates a policy is fully enacted. In cases in which a policy varia- 
ble captures the effects of collections of policies (for example, museum 
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closures and library closures), a policy variable is computed for each, 
then they are averaged, so the coefficient on this type of variable is 
interpreted as the effect if all policies in the collection are fully enacted. 
There are also instances in which multiple policies are deployed on 
the same date in numerous locations, in which case we group policies 
that have similar objectives (for example, suspension of transit and 
travel ban, or cancelling of events and no gathering) and keep other 
policies separate (that is, business closure and school closure). The 
grouping of policies is useful for reducing the number of estimated 
parameters in our limited sample of data, allowing us to examine the 
impact of subsets of policies (Fig. 2c). However, policy grouping does 
not make a substantial difference to the estimated effect of all policies 
combined nor to the effect of actual policies, which we demonstrate 
by estimating a regression model in which no policies are grouped 
and these values are recalculated (Extended Data Fig. 6a and Supple- 
mentary Table 4). 

In some cases (for Italy and the United States), policy data are avail- 
able at a more spatially granular level than infection data (for exam- 
ple, city policies and state-level infections in the United States). In 
these cases, we code binary policy variables at the more granular level 
and use population weights to aggregate them to the level of the 
infection data. Thus, policy variables may take on continuous values bet 
ween 0 and 1, witha value of 1 indicating that the policy is fully enacted 
for the entire population. Given the limited quantity of data currently 
available, we use a parsimonious model that assumes the effects of 
policies oninfection growth rates are approximately linear and additively 
separable. However, future studies that comprise more data may be able 
to identify important nonlinearities or interactions between policies. 

For each country, our general multiple regression model is thus 


Pe 
cit = log (lie) ~ log (/ei,e-)) = 8 o,ci + Oct + Hin + > (6, policy, ic) + ecit (7) 
p=1 


where observations are indexed by country c, subnational unit iand 
day t. The parameters of interest are the country-by-policy specific 
coefficients 6,,. We display the estimated residuals ¢,;,in Extended Data 
Fig. 10, which are mean zero but not strictly normal (normality is not 
a requirement of our modelling and inference strategy), and we esti- 
mate uncertainty over all parameters by calculating our standard errors 
robust to error clustering at the day level. This approach allows the 
covariance in €,,,across different locations within a country, observed 
on the same day, to be non-zero. Such clustering is important in this 
context because idiosyncratic events within a country, such asa holiday 
or a backlog in testing laboratories, could generate nonuniform 
country-wide changes in infection growth for individual days that are 
not explicitly captured in our model. Thus, this approach nonpara- 
metrically accounts for both arbitrary forms of spatial autocorrelation 
or systematic misreporting in regions of a country on any given day 
(we note that it generates larger estimates for uncertainty than cluster- 
ing by i). When we report the effect of all policies combined (Fig. 2b), 
weare reporting the sum of coefficient estimates for all policies ye Oxy 
accounting for the covariance of errors in these estimates when 
computing the uncertainty of this sum. 

Note that our estimates of 0 and @, in equation (7) are robust to sys- 
tematic underreporting of infections, a major concern in the ongoing 
pandemic, due to the construction of our dependent variable. This 
remains true even if different localities have different rates of under- 
reporting, so long as the rate of underreporting is relatively constant. 
Tosee this, note that if each locality ihas a medical system that reports 
only a fraction yw, of infections such that we observe f,, = WJ, rather an 
actual infections /,, then the left side of equation (7) will be 


log.) ~ log (,.-») = log @f.) ~ log Wf, 
=log(y) — logy) + log(h,) — log(d, --1) 
=log(i,) - log(, -) =8, 


and is therefore unaffected by location-specific and time-invariant 
underreporting. Thus systematic underreporting does not affect our 
estimates for the effects of policy @. As discussed above, potential 
biases associated with non-systematic underreporting that results 
from documented changes in testing regimes over space and time are 
absorbed by region-day-specific effects y. 

However, if the rate of underreporting within a locality is changing 
day-to-day, this could bias infection growth rates. We estimate the mag- 
nitude of this bias (Extended Data Fig. 2), and verify that it is quantita- 
tively small. Specifically, if 7, = WJ, where w,, changes day-to-day, then 


log(i,) - log; --) = log(y,) - log(y, ,_,.) +8, (8) 


where log(w;,) — log(W;,,- 1) is the day-over-day growth rate of the 
case-detection probability. Disease surveillance has evolved slowly in 
some locations as governments gradually expand testing, which would 
cause W,, to change over time, but these changes in testing capacity 
do not appear to significantly alter our estimates of infection growth 
rates. In Extended Data Fig. 2, we show one set of epidemiological 
estimates” for log(,,) — log(W;,_;). Despite random day-to-day varia- 
tions, which do not cause systematic biases in our point estimates, the 
mean of log(y;,) — log(W;,_1) is consistently small across the different 
countries: 0.05 in China, 0.064 in Iran, 0.019 in South Korea, -0.058 
in France, 0.031 in Italy and 0.049 in the United States. The average of 
these estimates is 0.026, potentially accounting for 7.3% of our global 
average estimate for the no-policy infection growth rate (0.36). These 
estimates of log(w;,) — log(;,-,) also do not display strong temporal 
trends, alleviating concerns that time-varying underreporting gener- 
ates sizable biases in our estimated effects of anti-contagion policies. 


Transient dynamics. In China, we are able to examine the transient re- 
sponse of infection growth rates following policy deployment because 
only three policies were deployed early ina seven-week sample period 
during which we observe many cities simultaneously. This provides us 
with sufficient data to estimate the temporal structure of policy effects 
without imposing assumptions regarding this structure. To do this, we 
estimate a distributed-lag model that encodes policy parameters us- 
ing weekly lags based on the date that each policy is first implemented 
in locality i. This means the effect of a policy implemented one week 
ago is allowed to differ arbitrarily from the effect of that same policy 
in the following week, and so on. These effects are then estimated si- 
multaneously and are displayed in Fig. 2b, c (see also Supplementary 
Table 3). Such a distributed lag approach did not provide statistically 
meaningful insights in other countries using the currently available 
data because there were fewer administrative units and shorter peri- 
ods of observation (that is, smaller samples), and more policies (that 
is, More parameters to estimate) in all other countries. Future studies 
may be able to successfully explore these dynamics outside of China. 
As arobustness check, we examine whether excluding the transient 
response from the estimated effects of policy substantially alters our 
results. We do this by estimating a ‘fixed lag’ model, in which we assume 
that policies cannot influence infection growth rates for L days, recod- 
ing a policy variable at time tas zero if'a policy was implemented fewer 
than days before t. We reestimate equation (7) for each value of L and 
present results in Extended Data Fig. 5 and Supplementary Table 5. 


Alternative disease models. Our main empirical specification is mo- 
tivated with an SIR model of disease contagion, which assumes zero 
latent period between exposure to COVID-19 and infectiousness. If we 
relax this assumption to allow for a latent period of infection, as ina 
susceptible-exposed-infected-recovered (SEIR) model, the growth 
of the outbreak is only asymptotically exponential”. Nonetheless, we 
demonstrate that SEIR dynamics have only a minor potential impact 
on the coefficients recovered by using our empirical approach in this 
context. In Extended Data Figs. 8, 9 we present results froma simulation 


exercise which uses equations (9)-(11), along with a generalization to 
the SEIR model” to generate synthetic outbreaks (see Supplementary 
Methods section 2). We use these simulated data to test the ability of our 
statistical model (equation (7)) to recover both the unimpeded growth 
rate (Extended Data Fig. 8) as well as the impact of simulated policies on 
growth rates (Extended Data Fig. 9) when applied to data generated by 
SIR or SEIR dynamics over a wide range of epidemiological conditions. 


Projections 

Daily growth rates of infections. To estimate the instantaneous daily 
growth rate of infections if policies were absent, we obtain fitted values 
from equation (7) and compute a predicted value for the dependent 
variable when all P, policy variables are set to 0. Thus, these estimated 
growthrates gr" policy capture the effect of all locality-specific factors 
on the growth rate of infections (for example, demographics), 
day-of-week effects, and adjustments based on the way in which infec- 
tion cases are reported. This counterfactual does not account for 
changes in information that are triggered by policy deployment, as 
those should be considered a pathway through which policies affect 
outcomes, as discussed in the main text. Additionally, the ‘no policy’ 
counterfactual does not model previously unobserved changes in 
behaviour that might occur if fundamentally new behaviours emerge 
even in the absence of government intervention. When we report an 
average no-policy growth rate of infections (Fig. 2a), it is the average 
value of these predictions for all observations in the original sample. 
Location-and-day-specific counterfactual predictions (en policy) 
accounting for the covariance of errors in estimated parameters, are 
shown as red markers in Fig. 3. 


Cumulative infections. To provide a sense of scale for the estimated 
cumulative benefits of effects shown in Fig. 3, we link our reduced-form 
empirical estimates to the key structures in a simple SIR system and 
simulate this dynamical system over the course of our sample. The 
system is defined as the following: 


dS, 
=p si (9) 
di, 
AE = (B.S, . yl, (10) 
dR 
OR yy, (a1) 


where S,is the susceptible population and R, is the removed population. 
Here B, is a time-evolving parameter, determined by our empirical 
estimates as described below. Accounting for changes in S becomes 
increasingly important as the size of cumulative infections (/, + R,) 
becomes a substantial fraction of the local subnational population, 
which occurs insome no-policy scenarios. Our reduced-form analysis 
provides estimates for the growth rate of active infections (¢) for each 
locality and day, ina regime where S,= 1. Thus we know 


di, “ 
acl ls-1=8,=B,-V (12) 


but we do not know the values of either of the two right-side terms, 
which are required to simulate equations (9)-(11). To estimate y, we 
note that the left-side term of equation (11) is 

dR, 


d ; ; : 
ae = ap (Cumulative recoveries + cumulative deaths) 


which we can observe in our data for China and South Korea. Comput- 
ing first differences in these two variables (to differentiate with respect 
to time), summing them, and then dividing by active cases gives us 


estimates of y (medians: China = 0.11, South Korea=0.05). These values 
differ slightly from the classical SIR interpretation of y, because inthe 
public data that we are able to obtain, individuals are coded as ‘recov- 
ered’ when they no longer test positive for COVID-19, whereas in the 
classical SIR model this occurs when they are no longer infectious. We 
adopt the average of these two medians, setting y= 0.08. We use medi- 
ans rather than simple averages because low values for /,induce along 
right tail in daily estimates of yand medians are less vulnerable to this 
distortion. We then use our empirically based reduced-form estimates 
of g (both with and without policy) combined with equations (9)-(11) 
to project total cumulative cases in all countries (Fig. 4). We simulate 
infections and cases for each administrative unit in our sample begin- 
ning on the first day for which we observe 10 or more cases (for that 
unit) using a time step of 4h. Because we observe confirmed cases 
rather than total infections, we seed each simulation by adjusting 
observed /,on the first day using country-specific estimates of case 
detection rates. We adjust existing estimates of case underreporting” 
to further account for asymptomatic infections assuming an infec- 
tion-fatality ratio of 0.75%”>. We assume R,= 0 on the first day. To main- 
tain consistency with the reported data, we report our output in 
confirmed cases by multiplying our simulated /, + R, values by the 
aforementioned proportion of infections confirmed. We estimate 
uncertainty by resampling from the estimated variance-covariance 
matrix of all regression parameters. In Extended Data Fig. 7, we show 
sensitivity of this simulation to the estimated value of y as well as to 
the use of an SEIR framework. In Supplementary Table 6, we show sen- 
sitivity of this simulation to the assumed infection-fatality ratio 
(see Supplementary Methods section 1). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The datasets generated and/or analysed during the current study are 
available at https://github.com/bolliger32/gpl-covid. Future updates 
and/or extensions to data or code will be listed at http://www.globalpo- 
licy.science/covid19. 


Code availability 


For easier replication, we have created a CodeOcean ‘capsule’, which 
contains a pre-built computing environment in addition to the 
source code and data. This is available at https://codeocean.com/cap- 
sule/1887579/tree/v1. Future updates and/or extensions to data or code 
will be listed at http://www.globalpolicy.science/covid19. 
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conducted this comparison for Chinese provinces and South Korea, for which not assign to particular cities (in China) or provinces (in Korea). 
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Extended Data Fig. 2| Estimated trends in case detection over time within 


each country. Systematic trends in case detection may potentially bias estimates 
of no-policy infection growth rates (see equation (8)). We estimate the potential 
magnitude of this bias using data from the Centre for Mathematical Modelling of 
Infectious Diseases”’. Markers indicate daily first differences in the logarithm of 


the fraction of estimated symptomatic cases reported for each country over 


time. The average value over time (solid line and value denoted in panel title) is 
the average growth rate of case detection, equal to the magnitude of the 
potential bias. For example, in the main text we estimate that the infection 
growth rate in the United States is 0.29 (Fig. 2a), of which growth in case 
detection might contribute 0.049 (this figure). Sample sizes are 75 in China, 41in 
Iran, 40 in South Korea, 29 in France, 40 in Italy and 32 in the United States. 
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respectively. For each country panel, ifa single region is influential, the estimated 
value when it is withheld from the sample will appear as an outlier. Samples that 
omit an influential region are highlighted with an open pink circle. As in Fig. 2b, 
we estimate a distributed lag model for China and display each of the estimated 
weekly lag effects (where the pink circle is the same ‘without Hubei’ sample for 
lags). The full sample includes 3,669 observations in China, 595 in South Korea, 
2,898 in Italy, 548 in Iran, 270 in France and 1,238 in the United States. 


Extended Data Fig. 3 | Robustness of the estimated no-policy growth rate of 
infections and the combined effect of policies to withholding blocks of data 
from entire regions. a, b, For each country, we reestimated equation (7) using 
real data k times, each time withholding one of the k first-level administrative 
regions (‘Adml1, that is, state or province) in that country. Each grey circle is 
either the estimated no-policy growth rate (a) or the total effect of all policies 
combined (b), from one of these k regressions. Red and blue circles show 
estimates from the full sample, identical to the results presented in Fig. 2a, b, 
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Extended Data Fig. 4| Robustness of the estimated effects of individual 
policies to withholding blocks of data from entire regions. Same as 
Extended Data Fig. 3, but for individual policies (analogous to Fig. 2c). WFH 
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which two regions are influential, asecond region is highlighted with an open 
green circle. The full sample includes 3,669 observations in China, 595 in South 
Korea, 2,898 in Italy, 548 in Iran, 270 in France and 1,238 in the United States. 
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Extended Data Fig. 5| Evidence to support models in which policies affect 
infection growth rates in the days following deployment. Existing evidence 
has not demonstrated whether policies should affect infection growth rates in 
the days immediately after deployment. Itis therefore not clear ex ante 
whether the policy variables in equation (7) should be encoded as ‘on’ 
immediately following a policy deployment. We estimate ‘fixed-lag’ models in 
whicha fixed delay between the deployment of a policy and its effect is 
assumed (see Supplementary Methods section 3). If a delay model is more 
consistent with real world infection dynamics, these fixed lag models should 


recover larger estimates for the impact of policies and exhibit better model fit. 


a, R? values associated with fixed-lag lengths varying from 0 to 15 days. Centre 


values represent the R’ value in our sample, whiskers are 95% confidence 
interval computed through resampling with replacement. In-sample fit 
generally declines or remains unchanged if policies are assumed to have a delay 
longer than 4 days. b, Estimated effects for no lag (the model reported inthe 
main text) and for fixed lags between land 5 days. Centre values represent the 
point estimate, error bars are 95% confidence intervals. Estimates generally are 
unchanged or shrink towards zero (for example, home isolation in Iran), 
consistent with mis-coding of post-policy days as no-policy days. The sample 
size is 595 in South Korea, 2,898 in Italy, 548 in Iran, 270 in France and 1,238 in 
the United States. 
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Extended Data Fig. 6 | Estimated infection or hospitalization growth rates China. The sample size is 46 observations. c, Same as Fig. 3, but using 
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scenario. a, The estimated daily growth rates of active (ChinaandSouthKorea) government stopped reporting cumulative cases after 25 March 2020). The 
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deployments within each subnational unit (blue) and inascenarioin whichno and no-policy predictions is our estimated effect of actual anti-contagion 
policies were deployed (red). Identical to Fig. 3, but using an alternative policies on the growth rate of infections (or hospitalizations). The markers are 
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packages. The sample size is 3,669 in China, 595 in South Korea, 2,898 in Italy, confidence intervals). Black circles are observed changes in log(infections) (or 
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equation (7) is implemented for a single example administrative unit: Wuhan, units. 


Article 


a 
Estimated no-policy cases 

® 66,000,000 
8 

[S) 

3a 

E = 63,000,000 
ef 

95 

5 2 

& 8 60,000,000 
oon 

PS 

o © 

cc 

a 57,000,000 
© 

£ 

D 

Ww 

c 


65,000,000 
~ 62,500,000 
QS 

oC oO 

OE 

> £ 60,000,000 
or 

n 

B ® 57,500,000 
Sa 

=o 

zs 

8 55,000,000 


Extended Data Fig. 7 | Sensitivity of estimated averted/delayed infections to 
the choice of yand oin an SIR/SEIR framework. The sensitivity of total averted/ 
delayed cases presented in Fig. 4 to alternative modelling assumptions. We 
compute total cases across the respective final days in our samples for the six 
countries presented in our analysis. The figure displays how these totals vary 
with eight values of y (0.05-0.4) and four values of o (0.2, 0.33, 0.5, ), where the 
final value of o (°) corresponds to the SIR model. a, The simulated total number 
of infections under no policy. b, Same as ina, but using actual policies. c, The 
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difference between a and b, which is the total number of averted/delayed 
infections. d, Same asc, but ona logarithmic scale similar to Fig. 4 (a-c are ona 
linear scale, trimmed to show details). Figure 4 uses y = 0.079, which we calculate 
using empirical recovery/death rates in countries for which we observed them 
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Extended Data Fig. 8 | Simulating reduced-form estimates for the no-policy 
growth rate of infections for different population regimes and disease 
dynamics. We examine the performance of reduced-form econometric 
estimators through simulations in which different underlying disease 
dynamics are assumed (see Supplementary Methods section 2). Each 
histogram shows the distribution of econometrically estimated values across 
1,000 simulated outbreaks. Estimates are for the no-policy infection growth 
rate (analogous to Fig. 2a) when three different policies are deployed at random 
moments in time. The black line shows the correct value imposed on the 
simulation and the red histogram shows the distribution of estimates using the 
regression in equation (7), applied to data output from the simulation. The grey 
dashed line shows the mean of this distribution. The 12 subpanels describe the 
results when various values are assigned to the mean infectious period (y“) and 
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Extended Data Fig. 9 | Simulating reduced-form estimates for 
anti-contagion policy effects for different population regimes and 
assumed disease dynamics. Same as Extended Data Fig. 8, but estimates are 
for the combined effect of three different policies (analogous to Fig. 2b) that 
are deployed at random moments in time. a, Simulations in near-ideal data 
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Data collection Data collection is detailed in the appendix. 
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Data exclusions Case data were excluded from dates with < 10 confirmed cases because of concerns about statistical reliability. 
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The ability of the skin to grow in response to stretching has been exploited in 


reconstructive surgery’. Although the response of epidermal cells to stretching has 
been studied in vitro’, it remains unclear how mechanical forces affect their 
behaviour in vivo. Here we develop a mouse model in which the consequences of 
stretching on skin epidermis can be studied at single-cell resolution. Using a 
multidisciplinary approach that combines clonal analysis with quantitative modelling 
and single-cell RNA sequencing, we show that stretching induces skin expansion by 
creating a transient bias in the renewal activity of epidermal stem cells, while asecond 
subpopulation of basal progenitors remains committed to differentiation. 
Transcriptional and chromatin profiling identifies how cell states and gene-regulatory 
networks are modulated by stretching. Using pharmacological inhibitors and mouse 
mutants, we define the step-by-step mechanisms that control stretch-mediated tissue 
expansion at single-cell resolution in vivo. 


In shaping tissue architecture, cells are subject to mechanical forces 
arising intrinsically through the cytoskeleton, by cell-cell adhesion, 
and from the microenvironment through interaction with the extra- 
cellular matrix (ECM)*». Cells sense and respond to these cues via the 
integrated activation of different signalling pathways. This process of 
mechanotransduction eventually leads to changes in cell shape, gene 
expression and cell fate®. Although studies have shown that mecha- 
notransduction can dictate cell behaviour in vitro’, the underlying 
mechanisms that allow organs to sense and mediate mechanical cues 
in vivo are only beginning to be revealed. 

Asthe first barrier against the environment, the skin is highly exposed 
to mechanical stress. The skin must resist and respond to physical insults, 
as well as adapt its shape and size to maintain vital barrier functions®. 

Mechanical stretch-mediated tissue expansion is a procedure com- 
monly used in plastic surgery to generate extra skin to repair birth 
defects, remove scars or for breast reconstruction’. Inthis procedure, an 
inflatable ‘skin expander’ is inserted below the skin and inflated, causing 
the expansion of the overlaying skin’. During the course of expansion, an 
excess of cells must be produced. But do all proliferative cells respond 
equally to stretch, or do subpopulations respond differentially? Howis 
mechanosensation linked to gene transcription, and which transcription 
factors relay mechanical stress to control expansion? 


Hydrogel induces mouse skin expansion 


To study the cellular and molecular mechanisms that regulate 
stretch-mediated expansion in vivo, we established a mouse model that 


mimics the procedure used in humans, in whichaself-inflating hydrogel 
is introduced underneath the skin. The hydrogel is designed to inflate 
toa predefined shape and size’. The 4-ml hydrogel devices expanded 
rapidly, reaching their maximum volume after one day (Extended Data 
Fig. la—d). During the first two days following expansion, the cell area 
was increased and cell density decreased, consistent with stretch. After 
four days, these parameters returned to their homeostatic values, 
indicating the production of new tissue. At day 2, BrdU incorporation 
increased twofold, and after day 4, BrdU incorporation decreased pro- 
gressively, reaching control levels by day 14 after expansion (Fig. la—e). 

To investigate whether the differentiation rate was affected, we 
assessed the production of keratin 1° (Krt1*) and Krt10* suprabasal 
cells following expansion. From day 4 onwards, we observed an increase 
in the number of Krt1*Krt10* suprabasal layers, demonstrating that 
stretch-mediated proliferation couples renewing divisions with dif- 
ferentiation (Fig. If, g). 

During morphogenesis and inin vitro cell culture, stretchis commonly 
associated with rearrangement of cell-cell junctions”. Transmission 
electron microscopy (TEM) showed that stretch induced intercellu- 
lar spacing and thicker keratin bundles. Desmosomes and hemides- 
mosomes remained unchanged (Extended Data Fig. le-q). Despite 
cellular remodelling, the integrity of the skin barrier was maintained, 
as assessed by transepithelial water loss (TEWL) (Extended Data Fig. Ir). 
Moreover, the expression of adherens-junction and tight-junction 
proteins was unchanged (Fig. 1h, i, Extended Data Figs. Is—-w and 2a-d). 
However, following expansion, the tension-sensitive epitope of 
a-catenin (a18)" was increasingly accessible, and vinculin expression 
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Fig. 1| Inflated hydrogel mediates skin expansion. a, Membranous signal 
(pink), from paraformaldehyde-perfused Rosa26™”"" mice in the control 
condition (Ctrl) and day 1(D1) of the expansion condition (Exp). Scale bars, 

15 um. b, Area of basal cells measured froma.c, Basal cell density. Number of 
nuclei per 1,000 pm’ (five different independent areas of 40,000 pm? per 
mouse). d, Immunostaining for Krt14 (red), BrdU incorporation (green) and 
Hoechst staining (nuclei, blue) on whole-mount epidermis. Scale bars, 20 um. 
e, BrdU-positive cells from d. f, Immunostaining for Krt14 (red), Krtland Krt10 
(green), and Hoechst staining (blue) on tissue sections. Scale bars, 20 um. g, 
Tissue thickness; three independent measurements from at least two sections 


was enriched at the adhesion sites, showing that adherens junctions 
are remodelled following stretch (Fig. 1j, k, Extended Data Fig. 2e, f). 

Although inflammation occurred following expansion, blocking 
inflammation by dexamethasone administration did not decrease 
proliferation (Extended Data Fig. 2g-m), suggesting that inflammation 
is not essential to mediate cell proliferation. 


Stretching promotes stem cell renewal 


To define the fate dynamics of epidermal cells during stretch-mediated 
skin expansion, we performed clonal analysis on Krt14-creER-Rosac’™ 
mice. As described previously” “, the steady increase in basal (and 
total) clone size over the two-week time course was compensated bya 
decrease in clone persistence, such that the total labelled cell fraction 
remained constant over time. Furthermore, the clone size distributions 
showed an exponential-like dependence (Fig. 2a-f, Extended Data 
Fig. 3a, b), consistent with the dynamics of a single equipotent cell 
population maintained through population asymmetric self-renewal, 
as found in other compartments of skin epidermis” ©. 

During stretch-mediated expansion, the average clone size was 
increased by 3.4-fold in the basal and 6.8-fold in the total cell content, 
twofold larger than under control conditions. However, in this case, 
the expansion in basal clone size was not compensated by a decreasein 
clone persistence (Fig. 2g-k, Methods), indicating that tissue expansion 
does not simply increase proliferation, but also changes the balance 
between renewal and differentiation. 

To understand the changes that take place in cell fate during expan- 
sion, we first considered the homeostatic control condition. Despite 
its exponential form, detailed inspection of the clone size distribution 
revealed evidence of a statistically significant bias towards clones bear- 
ing an even number of basal cells (65 + 5% (mean +s.d.) at day 14), a 
feature that was also present in the total clone size distribution (57+4% 
at day 14) (Fig. 2b). Remarkably, this feature was greatly enhanced in 
12-O-tetradecanoylphorbol-13-acetate (TPA)-treated (Extended Data 
Fig. 3d-g) and expansion conditions (Fig. 2g, Extended Data Fig. 3h-j), 
for both basal (80 + 11% and 84 + 6%, respectively, at day 14) and total 
(78 +11% and 81+ 6%, respectively, at day 14) clone size, suggesting that 
it did not derive from statistical noise or synchrony of division, but 
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per mouse. h,j, Staining of the adherens-junction component a-catenin (h) and 
the a18 tension-sensitive epitope of a-catenin (a18-catenin) (j) colour-coded 
for signal intensity with ImageJ. Protein expression is visualized as a colour 
gradient with black indicating no expression and yellow indicating maximum 
expression. Scale bars, 10 pm. i, k, Average integrated density signal for 
a-catenin (i) and «18-catenin (k). Each data pointis the average of five 
measurements per mouse. AU, arbitrary units. Inf, hj, the dashed line indicates 
the basal lamina. Inb, c,e, numbers of cells are in parentheses and n=number 
of mice. Two-tailed Mann-Whitney test; mean per mouse +s.e.m. 


may originate from aniche-like organization of tissue. To develop this 
idea, we considered an arrangement in which back-skin interfollicular 
epidermis (IFE) comprised a mosaic of units, each of which hosts two 
basal cells organized in a stem-progenitor cell-like hierarchy with one 
basal cell belonging to a renewing (stem cell) compartment and the 
other a progenitor cell committed to differentiation through terminal 
division and stratification (Fig. 21, Supplementary Note). To ensure 
the prevalence of even cell numbers in clones, we posited that the ter- 
minal division and stratification of progenitor cells is compensated 
predominantly by the asymmetric division of stem cells within the 
same unit. Further, to account for the expansion of clones beyond two 
basal cells, we proposed that correlated cell loss and replacement must 
also occur between neighbouring units. From a fit to the clonal data, 
we found that this model could predict the distributions of basal and 
total clone size with a division rate of once per 4.6 days, while around 
4 out of 5 divisions of renewing cells result in asymmetric fate out- 
come, a similar figure to that found in other skin compartments on 
the basis of amodel of intrinsic fate choice”. Notably, we found that 
a‘one-progenitor’ model based on the correlated differentiation and 
divisions of neighbours” could not reproduce the fine features of the 
data (Fig. 21, Extended Data Fig. 3k-p, Supplementary Note). 

Incommonwith control and TPA data, the distribution of clone sizes 
during expansion also showed convergence towards an exponential 
size dependence (Fig. 2k). We therefore investigated whether a minimal 
adjustment of the homeostatic model could predict the clone dynam- 
ics. Taking as an input the measured proliferation rate on the basis of 
BrdU incorporation (Fig. le), we found that a continuous adjustment 
of fate imbalance could account for the average clone size increase, as 
well as the enrichment of even-sized clones, providing evidence that 
progenitor heterogeneity is conserved during expansion (Extended 
Data Fig. 4a-g, Supplementary Note). 

To further challenge the two-progenitor model and the transient 
changes in cell fate following skin expansion, we marked cells during 
division using a BrdU single-pulse-chase assay. We assessed the fate of 
proliferative cells during stretch, discriminating between daughter cells 
biased for renewal (Krt14*Krt10-) and those committed to differentia- 
tion (Krt14*Krt10*). We found that stretching increased the proportion 
of BrdU*Krt14*Krt10° cells, consistent with the model, demonstrating 
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Fig. 2| Clonal analysis of epidermal stem cells during stretch-mediated 
skin expansion. a, Krt14-creER-Rosa@" clones (n=4 independent 
experiments). Second harmonic generation (SHG) imaging visualizes collagen 
fibres (white). Nuclei stained with 7-aminoactinomycin D (7AAD) (blue). Scale 
bars, 50 pm. b-k, Clonal analysis in control and expansion conditions. 

b, g, Distribution of clone sizes at day 14 based on basal and total cell number. 
c,h, Average clone size based on basal (black) and total (blue) cell content. 

d,i, Clone persistence. e,j, Average labelled cell fraction. f, k, Cumulative clone 
size distribution at day 14 showing an approximate exponential size 
dependence (lines). Inb-f: day 0, 115 clones from n=7 mice; day 2,175 clones 
fromn=7 mice; day 4, 136 clones fromn=5 mice; day 8,159 clones fromn=3 
mice; day 10, 146 clones from n=3 mice; day 14, 195 clones fromn=4 mice. 
Ing-k: day 2, 231 clones fromn=4 mice; day 4, 197 clones fromn=4 mice; day 8, 
199 clones fromn=4 mice; day 10, 157 clones fromn=4 mice; day 14,199 clones 
fromn=4 mice. I, Schematic showing the cellular organizationina 
one-progenitor model and the proposed two-progenitor model of back-skin 
IFE. Inthe two-progenitor model, the epidermis contains renewing stem cells, 
committed cells and suprabasal cells. In homeostasis, stem cells divide at an 
average rate A. This division has a probability of 1- rofanasymmetric fate 
outcome, leading to the replacement of the partner committed cell, whichin 
turnis lost through terminal division and stratification from the basal layer. 
The remaining divisions lead to the correlated loss and replacement of 
renewing cells through symmetric cell divisions. Probability is shownin 
parentheses. Inc-f, h-k, dataare mean +s.d. Inc-e, h-j, points show data and 
lines show results of atwo-progenitor model (see Supplementary Note). 


that expansion promotes an imbalance of the renewing population 
towards cell duplication (Extended Data Fig. 4h, i). 


Molecular features related to stretch 


To define the changes in gene expression following expansion, 
we performed microarray analysis on basal cells isolated by 
fluorescence-activated cell sorting (FACS) in different conditions 
(Extended Data Fig. 5a). Genes upregulated in both expanded and TPA 
conditions comprise those regulating cell cycle, DNA replication, cell 
survival and cytoskeleton remodelling (Extended Data Fig. 5b, c). Many 
of these genes are also upregulated during wound healing", suggesting 
that they represent acommontranscription program linked to cellular 
stress and proliferation. 

Genes that were preferentially upregulated in the expanded condi- 
tion were enriched for those related to cell-cell and cell-ECM adhesion, 
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small GTPase, regulators of the actomyosin cytoskeleton (Fig. 3a) and 
genes regulating proliferation, including Egfr, Ras, mitogen-activated 
protein kinase (MAPK), activator protein 1 (AP1), transcription factors 
Junb, Fos, Fosb and Fosl1, and components of the YAP-TEAD pathway 
(for example, Tead1 and the canonical YAP target gene Cyr61) (Fig. 3b). 
Consistently, we found changes in the expression of some of these 
adhesion and cytoskeleton proteins by immunostaining and FACS 
analysis (Extended Data Fig. 5d-f). 

To unravel the changes in the chromatin landscape that are associated 
with expansion, we used an assay for transposase-accessible chromatin 
using sequencing (ATAC-seq) to identify the chromatin regions that 
were remodelled at day 2 following expansion. To define the transcrip- 
tion factors associated with chromatin remodelling, we performed 
motif-discovery analyses on the chromatin-remodelled regions. The 
most frequent motifs associated with open chromatin regions cor- 
responded to API, p63, STAT, ETS, CEBP, AP2A and GRHL2 (Fig. 3c, 
Extended Data Fig. 5g-k, Supplementary Tables 1, 2). Immunostaining 
confirmed overexpression of members of the AP1 transcription factor 
family such as FOSL1, FOS and JUN, as well as p63, pSTAT3 and KLF4 in 
the basal and early suprabasal cells (Fig. 3d—f, Extended Data Fig. 5I-t). 
These results show that stretch-mediated skin expansion is regulated by 
the EGFR-Ras-MAPK pathway, leading to the activation of JUN and FOS” 
transcription factors, transcription factors such as p63 that mediate 
epidermal stem cell renewal”, as well as transcription factors associated 
with differentiation, such as CEBP”, KLF4” and GRHL2”, allowing skin 
expansion while maintaining skin barrier functions. 


scRNA-seq during stretching 


To assess whether all basal cells respond equally to stretch, we per- 
formed single-cell RNA sequencing (SCRNA-seq) on FACS-isolated cells 
enriched for basal IFE, infundibulum, sebaceous gland and basal upper 
hair follicle cells in control conditions (n=4,659), after TPA treatment 
(n=4,342) and following expansion (day 1,n=4,934 and day 4, n=2,716) 
(Extended Data Fig. 5a). Using graph-based clustering, we identified 
different subpopulations on the basis of marker gene expression. These 
different clusters were found in all conditions (Extended Data Fig. 6a-l), 
and were similar to the clusters described in a previous study”*”>. 

On the basis of the clonal analysis, we first questioned whether the 
scRNA-seq data supported the existence of proliferative heteroge- 
neity in the IFE. When only IFE cells were analysed in control condi- 
tions, we identified a population of undifferentiated cells with astem 
cell-like signature, a population of proliferative basal committed cells 
co-expressing basal and early differentiation markers suchas Krtland 
Krt10, and a population of non-cycling cells expressing differentiation 
markers (Fig. 3g). The same IFE populations were found in all condi- 
tions, although the proliferative activity of stem cells and committed 
cells were proportionately increased in expansion and TPA conditions 
(Fig. 3g, Extended Data Fig. 7a—-f, Supplementary Table 3). 

In addition, cell states expressing genes associated with stress and 
hyperproliferation (for example, Krt6a, Sprria, S100a8, KIk10) were 
found in the expanded condition (Fig. 3g, Extended Data Fig. 7g). 
Notably, the stem cell-like ‘stretch’ cluster appearing on day 1 fol- 
lowing expansion—identified by the expression of basal markers (for 
example, Krt14, Itgb1) and higher expression of Ly6a, H2-K1, Thy1 and 
Mt2 (Fig. 3g-i, Extended Data Fig. 7h)—presented increased activity 
of transcription factors regulating proliferation and immediate early 
genes (for example, F2f1 and Fgr1) as well as AP1 transcription factors 
(for example, Fos, Junb and Jund), inflammation (for example, Stat 
and Stat3) and commitment or differentiation (for example, K/f4), 
as analysed by SCENIC analysis”° (Fig. 3j, Extended Data Fig. 8a, Sup- 
plementary Tables 4-6), suggesting that only a fraction of basal cells 
respond transcriptionally to mechanical stress. 

To determine the lineage trajectories among the different sub- 
populations, we performed pseudotemporal ordering of cells using 
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Fig. 3| Transcriptional and chromatin remodelling associated with 
stretch-mediated skin expansion. a, b, mRNA expression of genes 
upregulated in day 4 of expansion conditions (n=3) compared with 
TPA-treated cells (n= 2). Bars are mean withs.e.m. of the fold change over 

the average value of the control (n=3). Cyr61is also knownas Ccn1. Prolif., 
proliferation. c, Transcription factor motifs enriched in the ATAC-seq 

peaks that were upregulated on day 2 of expansion conditions compared 

with control (n= 3,262 target sequences, 46,200 background sequences) 

as determined by Homer analysis using known motif search. 
d,Immunohistochemistry for FOSL1. e, f, Immunofluorescence for p63 (e) 

or KLF4 (f) and Krt14, and Hoechst staining. In d-f, dashed lines delineate the 
basal lamina. Scale bars, 20 pm. n=3 independent experiments. g, Uniform 
manifold approximation and projection (UMAP) graphic of the clustering 
analysis for sCRNA-seq of the control (n =3,142 cells) and expansion day 1 
(n=3,756 cells) IFE projected on an integrated embedding of the dataset. 

h, Violin plots of the indicated genes at expansion day 1in the stem cell (SC) 
(n=700 cells) and stem cell-like stretch (SC stretch) (n=801 cells) clusters (see 
Source Data). i, UMAP plots coloured by normalized gene expression values 
for the indicated genes in the control and expansion day 1IFE. Gene expression 
is visualized as a colour gradient with grey indicating no expression (thatis, 
expression below the 50th percentile across each respective sample) and 
yellow indicating maximum (max) expression.j, UMAP plots coloured by the 
degree of regulon activation for transcription factors differentially activated 
(area under the curve (AUC) rank-sum test FDR corrected P-value < 0.05) inthe 
different conditions. Colour scaling represents the normalized AUC value of 
target genes in the regulon being expressed as computed using SCENIC. 

k, L, Lineage trajectories (black lines) computed using Slingshot. Clusters 

are colouredasing. Ini-I, n=3,142 cells (control) andn=3,756 cells (expansion 
day 1). 


Slingshot” in control and expansion day 1 conditions. In all condi- 


tions, we found atrajectory starting from undifferentiated basal stem 
cells, passing through the progenitor state, and terminating with the 
most differentiated cells (Fig. 3k, |, Extended Data Fig. 8b). In expan- 
sion day 1 conditions, we resolved another differentiation pathway 
involving progression through the stress states, delineating the dif- 
ferent responses to mechanical cues (Fig. 3l, Extended Data Fig. 8c, d). 
Together, these data show that stem cells present a rapid and profound 
response to stretch. 


Mechanosensing at adherens junctions 


The results of TEM and molecular analysis suggested that stretch- 
ing induces remodelling of the cytoskeleton, as well as an increase at 
the transcription level of several genes encoding remodellers of the 
actomyosin cytoskeleton, suchas Diaph2 and Diaph3, and formin-like 
proteins involved in actin regulation’’ (Fig. 3a, Extended Data Fig. le-g, 
k, Extended Data Fig. 5c, k). To test whether rearrangements of actin 
organization are essential to mediate the response to stretch, we stud- 
ied the consequence of Diaph3 deletion (Diaph3 conditional knockout 
(cKO)). The increase in the number of cells presenting F-actin struc- 
tures on their apical surface on day 1 of expansion was reduced, and no 
increase in cell proliferation following stretch was observed in Diaph3 
cKO mice (Fig. 4a, Extended Data Fig. 9a-g). 

Mechanical stretch has been shown to promote the phosphorylation 
of myosin II, relaying mechanotransduction in cell lines in vitro? andin 
the Drosophila wing disc”’. Consistently, following conditional ablation 
of Myh9, akey subunit of myosin IIA (Myh9 cKO), proliferation was not 
increased in response to stretch (Fig. 4b, Extended Data Fig. 9h-I). 

As basal cells sense stretching by remodelling the traction at adher- 
ens junctions, we assessed whether Diaph3 and Myh9 cKO prevent adhe- 
sion remodelling. The accessibility of the a-catenin tension-sensitive 
epitope and vinculin staining were not increased following stretching 
in Diaph3 and Myh9 cKO mice (Extended Data Fig. 9m-r). Deletion of 
Diaph3 and Myh@ following expansion resulted in the incapacity of 
the epidermis to adapt to stretch, causing a barrier defect (Fig. 4c). 


MEK-ERK-API1 regulate expansion 

Previous studies demonstrated that in vitro mechanical stretching 
induces proliferation by activation of MEK signalling’, and that ERK is 
activated onstress fibres in a myosin II-dependent manner®. To assess 
whether pharmacological inhibition of the MEK-ERK pathway could 
impair the cellular behaviour following skin stretching, we treated mice 
with the MEK inhibitors trametinib and pimasertib. Both inhibitors 
reduced phosphorylation of ERK and resulted in a decrease of cell pro- 
liferation and the number of differentiated cells induced by stretching 
(Extended Data Fig. 10a—g), demonstrating the importance of MEK- 
ERK-API signalling in the regulation of the stretch response in vivo. 


YAP and MAL regulate skin stretching 


Previous studies have shown that YES-associated protein 1 (YAP1), the 
downstream coeffector of Hippo signalling, and megakaryoblastic 
leukaemia/myocardin-like 1 (MAL, also known as MRTFA or MKL1), 
the co-factor of serum response factor (SRF), are induced in response 
to mechanical stimuli®”*?. In the skin, YAP-TAZ and SRF pathways are 
essential for epidermal development and wound repair, but are dis- 
pensable for homeostasis**®. In vitro studies have demonstrated the 
role of MAL-SRF in the survival and differentiation of keratinocytes 
cultured on micropatterned surfaces*, and that of YAP as a driver of 
proliferation in stretched cultured cells in vitro”. 

To test whether and when YAP and MAL are activated in vivo, we 
assessed their subcellular localization at different times during 
expansion. Whereas YAP and MAL were localized preferentially in the 
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Fig. 4| Molecular regulation of stretch-mediated skin expansion. a, 
BrdU-positive cells in control and expansion conditions in Diaph3 wild-type 
(WT) or cKO mice. b, BrdU-positive cells in control and expansion conditions in 
Myh9WT and cKO mice. c, TEWL measurements from n=2 Diaph3 WT, n=2 
Myh9WT (black), n=3 Diaph3 cKO (pink) and n=3 Myh9 cKO mice (violet). d, 
BrdU-positive cells in control and expansion conditions in Yap1-Taz WT or cKO 
mice.e, TEWL measurements from n=2 Yap1-TazWT mice (black) andn=3 
Yap1-Taz cKO mice (green). f, BrdU-positive cells in untreated or treated mice 
with the MAL inhibitor in control and expansion conditions. g, Immunostaining 
for YAP1 (top) and MAL (bottom) onskin sections of Diaph3 cKO and Myh9 cKO 
mice at expansion day 4. The control images are froma Diaph3 WT (top) mouse 
and aMyh9WT (bottom) mouse. Dashed lines delineate the basal lamina. Scale 
bars, 20 pm.h, i, YAP1(h) and MAL (i) subcellular localization in skin sections 
represented ing. Data are mean+s.e.m.N>C, more proteinin nucleus than in 
cytoplasm; N=C, similar level of protein in nucleus as in cytoplasm; N <C, less 
proteinin nucleus than in cytoplasm. n=150 cells per condition.j, UMAP 
computed onthe integrated single-cell RNA-seq dataset coloured for the 
different cellular clusters in IFE. n =3,869 cells (expansion day 1 untreated); 
n=4,762 cells (expansion day 2 MAL inhibitor); n=3,254 cells (expansion day 2 
trametininb). k, Quantification of the cellular clusters inj. Ina-f, data are 
mean+s.e.m.;n=number of mice; total numbers of cells analysed are indicated 
in parentheses. Ina, b, d, f, two-tailed Mann-Whitney test. Inc, e, each data 
point represents the average of 30 individual subsequently recorded 
measurements at the probe head. 


cytoplasm in the control, they gradually translocated to the nucleus 
of basal cells during expansion (Extended Data Fig. 1Oh-k). These data 
reveal that YAP and MAL are activated immediately following stretch- 
ing, but only in a subset of basal cells, similar to AP1 family members 
(Extended Data Fig. 101). 
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To define the functional role of YAP, we induced deletion of Yap1 and 
Taz specifically in the epidermis (Yap1-Taz cKO) and assessed their 
role during stretch. A significant decrease in BrdU incorporation was 
observed already by day 2in YapI-Taz cKO epidermis, as wellas a reduc- 
tion in epidermal thickness (Fig. 4d, Extended Data Fig. 10m-q). In 
addition, the deletion of Yap1 and Taz following hydrogel expansion 
caused a barrier defect (Fig. 4e). 

To assess the role of MAL activity, we treated the mice immediately 
after surgery with the pharmacological MAL-SRF inhibitor CCG20397°*, 
and founda significant decrease of BrdU incorporation and epidermal 
thickness at day 4 (Fig. 4f, Extended Data Fig. 10r—u). Notably, inhibi- 
tion of both pathways completely blocked the response to stretch 
(Extended Data Fig. 1la—d). 

As Diaph3 and Myh9 control adhesion remodelling upon stretching, 
we tested whether they act upstream of YAP1 and MAL. In Diaph3 cKO 
and Myh9 cKO epidermis, YAP1 and MAL were not translocated into the 
nucleus following stretch (Fig. 4g-i, Extended Data Fig. lle, f), demon- 
strating the essential role of Diaph3 and Myh9 in regulating canonical 
mechanotransducers. Together, these experiments demonstrate that 
mechanical stretch couples cytoskeletal contractility with the nuclear 
effectors of mechanotransduction and stem cell activation in vivo. 


scRNA-seq after MEK and MAL inhibition 


To define the consequences of the inhibition of the signalling pathways 
that control stretch-induced proliferation onthe cellular heterogene- 
ity and stem cell dynamics in vivo, we performed short-term lineage 
tracing of basal cells using BrdU pulse-chase combined with immuno- 
histochemistry. The results showed a decrease in renewing divisions 
(BrdU*Krt14*Krt10_) in the YAP-TAZ cKO mice and upon inhibition of 
MAL-SRF or MEK-ERK-API (Extended Data Fig. 11g-l). In addition, 
short-term genetic lineage tracing in the presence of MAL-SRF or MEK- 
ERK-AP1 inhibition showed suppression of the increase of clone size 
following skin expansion (Extended Data Fig. 12a—-f). scRNA-seq at day 
2 after expansion in the presence of the MEK-ERK-AP1 and MAL-SRF 
inhibitors showed that, although the inhibition of both pathways led 
toasimilar decrease in cell proliferation, the proportion of cells in dif- 
ferent states differed between treatments. Cellular heterogeneity was 
relatively well conserved following the inhibition of the MEK-ERK-AP1 
pathway, whereas the MAL-SRF inhibitor diminished the abundance 
of stem cells with stress characteristics and decreased the heterogene- 
ity of basal cells (Fig. 4j, k, Extended Data Fig. 12g, h, Supplementary 
Tables 7-9), suggesting that while the MEK-ERK-API1 pathway con- 
trols only proliferation induced by stretching, the MAL-SRF pathway 
controls cellular heterogeneity in the basal layer and stretch sensing. 


Discussion 


We have dissected the cellular and molecular spatiotemporal mecha- 
nisms that control the behaviour of epidermal stem cells in response to 
mechanical stretch. Quantitative modelling of clonal data suggests that 
proliferation and differentiation of basal cells in back-skin epidermis is 
spatially and temporally linked, as has been suggested during homeo- 
stasis”**. During stretch-mediated skin expansion, a transient shift of 
stem cell fate towards renewal allows the basal population to expand 
while differentiation is maintained. These results were supported by 
results of sCRNA-seq, which revealed the existence of different cell 
states within the basal layer, identifying the molecular signature of 
stem and committed cells in the mouse back skin, and highlighting the 
heterogeneous response of basal cells to stretching. 

Mechanistically, we found that regulators of the actomyosin cytoskel- 
eton, including formin-like proteins and non-muscle myosin, are essen- 
tial for sensing skin stretching in vivo, and act upstream of canonical 
mechanotransducers suchas YAP1and MAL. Of note, the same signal- 
ling pathways are activated during embryonic pancreas development*” 


and inthe fly notum upon mechanical compression“, suggesting that 
these signalling pathways and transcriptional regulators have a con- 
served role in mechanotransduction across the animal kingdom and 
between embryonic development and adult tissue regeneration. 
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Methods 


Mouse strains 

Krt14-creER (ref. ”) and Krt14-cre (ref. *) transgenic mice were pro- 
vided by E. Fuchs (Rockefeller University). Rosa@"* mice** mice were 
provided by H. Clevers (Hubrecht Institute). Yap“ and Taz“ mice* 
were obtained from G. Halder (KU Leuven) who received them from 
R. Johnson at the MD Anderson Cancer Center, Houston. Diaph3™ 
mice** were produced by F. Tissir (UCLouvain) and Myh9™ mice? were 
a gift from S. Wickstr6m (University of Helsinki). Rosa26”””° (ref. *”) 
mice were provided form I. Migeotte (Université Libre de Bruxelles). 
Mouse colonies were maintained ina certified animal facility in accord- 
ance with European guidelines. The experiments were approved by 
the local ethical committee (CEBEA) under protocols #604 and #605. 
The study is compliant with all relevant ethical regulations regarding 
animal research. 


Expander experiments 

Mice were anaesthetized (5% xylazine 10% ketamine in PBS) and the 
dorsal skin was disinfected with 10% iso-Betadine (Meda Pharma), 
an incision was created in the most caudal part of the dorsal skin to 
minimize any tension in the wound as the expansion progressed and 
to maximise the distance from the access wound, to the location of 
the expander. A subcutaneous pocket was created with forceps and 
a4-ml Hemispher Self-inflating tissue expander (Osmed) was placed 
inthe most rostral part under the dorsal skin in proximity of the neck. 
Stiches were used to close the subcutaneous pocket, to limit the hydro- 
gel movement and to close the access incision. 

Allanalyses were performed inthe area onthe top of the dome of the 
hemisphere induced by the skin expander, since this region experiences 
the highest strain. The hemispherical architecture of the hydrogel has, 
by its geometric nature, a higher degree of mechanical load at the apex 
of the hemisphere with isotropic stress* and only the skin overlying 
the very upper part of the hydrogel was studied. 


Sample size, randomization and blinding 

The sample size was chosen based on previous experience in the labora- 
tory, for each experiment to yield high power to detect specific effects. 
No statistical methods were used to predetermine sample size. The 
experiments were not randomized. All animals used were of similar 
age (60-90 days after birth) and between 28 and 33 g of weight. The 
investigators were not blinded to allocation during experiments and 
outcome assessment. 


Skin whole-mount 

For skin whole-mount confocal microscopy on induced Krt14-creER- 
Rosa@"* mice, pieces of entire skin of around 1 cm’ were cleared with 
Sca/eCUBIC-1and Sca/eCUBIC-2 solutions as described in”. The pieces 
were fixed overnight in 4% paraformaldehyde, incubated 3 days at 
37 °Cin Sca/eCUBIC-1 solution and incubated at room temperature in 
Sca/eCUBIC-2 solution for at least 2 days before their analysis. Coun- 
terstaining of nuclei was performed with 7AAD (1/1000, Thermo 
Fisher Scientific). Images were acquired at LiMiF (Université Libre de 
Bruxelles, Faculté de Médecine, Campus Erasme) on a LSM780NLO 
confocal system fitted on an Observer Z1 inverted microscope (Zeiss) 
equipped with a Chameleon Vision II 690-1064 nm multiphoton laser 
(Coherent Europe) using a Plan Apochromat 20x/0.8 dry objective or 
LD C Apochromat 40x/1.1 water immersion objective (Zeiss). Spectral 
images were acquired in lambda mode with a single excitation wave- 
length at 920 nmandaGaAsp spectral detector (Zeiss). Images 512 by 
512 pixels were acquired as az-stack across the thickness of the object. 
Fluorochromes were separated by linear unmixing using ZEN2012 
software (Zeiss). Settings were kept identical for all conditions; this 
does not apply to the depth (z stack) that was adjusted individually to 
encompass the region of interest. Single plane images were displayed 


using Zen2012 (Black Edition) software (Zeiss) and exported as uncom- 
pressed TIFF images. 


Clonal induction in Krt14-creER-RosaC"™™ mice 

For lineage tracing experiment, Krt14-creER-RosaC’ mice were 
induced at 2 months of age with Tamoxifen (Sigma) (0.1 mg/g diluted 
in 10% vol/vol Ethanol and sunflower seed oil, Sigma) by intraperito- 
neal injection. 


Inducing Yap1-Taz and Myh9 deletion 

For Yap1-Taz deletion in Krt14-creER Yapk“"Taz mice and for Myh9 
deletion in Krt14-creER Myh9™ mice, animals received a daily dose of 
2.5 mg of tamoxifen (Sigma-Aldrich), for 30 g of body weight, diluted 
ina mixture of 10% ethanol (VWR) 90% sunflower oil (Sigma-Aldrich) 
by intraperioneal administration, for 5 consecutive days and every 
other day after surgery. 


Mouse treatments 

For TPA treatments, TPA (200 ul of 0.02 mg/ml solution in acetone) 
was administered daily to shaved mouse back skin for 2 days. To inhibit 
MEK1/2 activity, mice were treated with Trametinib 2 mg per kg body 
weight by daily oral gavage, Pimasertib 20 mg per kg body weight 
daily by oral gavage for at least 7 consecutive days before analysis or 
as indicated in the figures. To inhibit MAL activity, mice received drug 
treatment via intraperitoneal injection daily using 100 mg CCG203971 
(Cayman Chemical) per kg body weight for 2 or 4 days as indicated inthe 
figures. For the dexamethasone experiments, dexamethasone powder 
(Sigma) was resuspended at 1 mg mI in ethanol 100% and diluted 5x 
in sterile PBS. The mice were injected intraperitoneally once per day at 
the dose of 1 mg kg”. The treatment started 2 days before the surgery to 
place the expander and was sustained until the end of the experiment. 


Monitoring hydrogel expansion, skin growth, quantification of 
clone size and persistence 

To measure the increase in the hydrogel volume, the height or radius 
of the expander (r) was measured every day (D) from DO to D14 and 
the volume calculated as the volume of a half sphere with the formula 
V=(2/3) x tr’. 

To measure basal cell area, whole-mount tissues were obtained 
from Rosa26™”"* mice that were fixed by paraformaldehyde perfu- 
sion. Images were acquired by two-photon confocal microscopy (40x 
objective, z=1 1m). Basal cells were identified based on the orthogonal 
view and if their basal side was in direct contact with the signal of the 
second harmonic generation. Cell area was measured using Image] after 
manually drawing the contour of each cell revealed by the fluorescence 
onthe membrane on the Rosa26”””"* mice in the x-y plane. 

To define the clone persistence, hair follicles were used as landmarks 
to calibrate the degree of expansion. During expansion, the total num- 
ber of hair follicles is not changed, implying that, for an area expansion 
bya factor e, the hair follicle density must fall by a factor 1/e. Ifthe area 
a of each hair follicle remains unchanged during the expansion, fora 
patch of tissue of area A containing hair follicles spanning an area A,,, 
the fractional increase in area during the expansion is given by 
€=A0-/Ayp, where Aq, is the corresponding area of hair follicles in the 
control animal. Then, with an average of nclonesin an area A, the clonal 
fractional change in clonal persistence is given by €n/no, where ny rep- 
resents the corresponding clone number in the control animals. For 
clonal persistence, large areas of tissue (at least 1 cm’ per animal) were 
acquired using the 25x objective and the number of basally attached 
clones was quantified on the two-photon confocal images ina fixed 
area of 4 mm7 across the different time points. 

To quantify the size of the clone, whole-mounts obtained from 
Krt14-creER-RosaC mice were imaged by two-photon confocal 
microscopy. Orthogonal view was used to see in 3 dimensions RFP, YFP, 
mCFP ornGFP-positive cells and quantify the number of basal and total 


cells per clone. Cells were considered as basal when their basal side was 
in direct contact with the signal of the second harmonic generation. 


Epidermal whole-mount and immunostaining 

Almost the entire back skin was dissected from the animal, the fat was 
removed by mechanical scraping witha scalpel and the remaining tissue 
was cut in pieces of around 2 cm? and incubated in PBS/EDTA (20 mM) 
on arocking plate at 37 °C for 90 min. Epidermis was separated from 
the dermis using forceps as an intact sheet and washed 2 times with 
PBS. Pieces of epidermis were pre-fixed in 4% paraformaldehyde for1 
hat room temperature. Epidermis was rinsed 2 times with PBS for 5 min 
and conserved in PBS with 0.2% azide at 4 °C. Forimmunofluorescence 
staining, entire pieces of epidermis were incubated in blocking buffer 
(1% BSA, 5% horse serum, 0.8% Triton in PBS) for 3h at room temperature 
on arocking plate (100 rpm). The samples were incubated in primary 
antibody with anti-Integrin B4 (rat, 1:200, BD Biosciences) overnight 
at 4 °C. Samples were then washed 3 times in PBS with 0.2% Tween for 
1h and incubated in appropriate secondary antibodies diluted 1:400 
in blocking buffer for 1h at room temperature on the rocking plate. 
For BrdU staining, samples were incubated in HCI 1M at 37 °C for 35 
min, washed with PBS 0.2% tween, stained with anti-BrdU (rat, 1:200, 
Abcam) in blocking buffer and with appropriate secondary antibody. 
The following secondary antibodies were used: anti-rat conjugated 
to AlexaFluor488 (Molecular Probes), to Rhodamine Red-X or to Cy5 
(Jackson Immuno Research). Alexa488 or Alexa633 conjugated phal- 
loidin (Thermo Fisher Scientific) was used 1:200 in blocking buffer 
to visualize F-actin microfilaments. Nuclei were stained in Hoechst 
solution diluted 1:5000 for 30 min and mounted in DAKO mounting 
medium supplemented with 2.5% Dabco (Sigma). All confocal images 
were acquired at room temperature with a LSM780 confocal system 
fitted on an AxioExaminer Z1 upright microscope equipped with 
C-Apochromat 40~/1.1 or Plan Apochromat 25x/0.8 water immersion 
objectives (Zeiss, lena, Germany). Optical sections 512 x 512 pixels were 
collected sequentially for each fluorochrome. The datasets generated 
were merged and displayed with ZEN2012 software (Zeiss). 


Histology and immunostaining on sections 

Skin epidermis was embedded in OCT and kept at -80 °C. Sections of 
6 um were cut using a CM3050S Leica cryostat (Leica Mycrosystems). 
After fixation in 4% paraformaldehyde for 10 min at room temperature, 
tissues were washed 3 times in PBS for 5 min and incubated in block- 
ing buffer (1% BSA, 5% horse serum, 0.2% Triton in PBS) for 1h at room 
temperature. Primary antibodies were incubated overnight at 4 °C. 
Sections were rinsed 3 times in PBS and incubated with secondary 
antibodies and Hoechst in blocking buffer for 1h at room tempera- 
ture. Sections were again washed three times with PBS. The following 
primary antibodies were used: anti-Krt14 (chicken, 1:20,000, custom 
batch, Thermo Fisher); anti-Krt1 (rabbit, 1:4,000, Covance/IMTEC, 
PRB-165P); anti-Krt10 (rabbit, 1:4,000, Covance/IMTEC, PRB-159P); 
anti-c-JUN (rabbit, 1:200; Proteintech, 24909-1-AP); anti-p63 (rabbit, 
1:100 Abcam, ab124762); anti-KLF4 (rabbit, 1:100 Abcam, ab129473); 
anti-pSTAT3 (rabbit, 1:100 Cell signaling, 9145); anti-YAP1 (rabbit, 1:100 
Proteintech, 13584-1-AP); anti-MAL (rabbit, 1:100, Sigma, HPAO30782); 
anti-a-catenin (rabbit, 1:1000 Sigma, C2081); anti-a18-catenin (rat 
monoclonal, 1:20,000, a gift form A. Nagafuchi, Kumamoto University); 
anti-Vinculin (mouse, 1:200 Millipore MAB3574); anti-p120-catenin 
(rabbit, 1:1000 Invitrogen PA5-82545); anti-E-cadherin (rat, 1:500, 
ebioscience 14-3249-82); anti-ZO-1(rabbit, 1:200 Invitrogen 61-7300); 
anti-Claudin-1 (rat, 1:100 Thermo Scientific 51-9000); anti-Paxillin 
(rabbit, 1:100 Abcam 32084); anti-CD45 (rat, 1:500, BD Biosciences 
553081); anti-CD68 (rabbit, 1:100 Abcam ab125212). 

The following secondary antibodies were used diluted to 1:400: 
anti-mouse, anti-rat, anti-rabbit, anti-chicken conjugated to Alexa 
Fluor 488 (Molecular Probes), to rhodamine Red-X (Jackson Immunore- 
search) or to Cy5 (Jackson Immunoresearch). Nuclei were stained in 


Hoechst solution (1:2000) and slides were mounted in DAKO mounting 
medium supplemented with 2.5% Dabco (Sigma). All images of section 
immunostaining were acquired using the Axio Imager M1 Microscope, 
the AxioCamMR3 or MrC5 camera and using the Axiovision software 
(Carl Zeiss). Acquisitions were performed at room temperature using 
20x numerical aperture (NA) 0.4 (Carl Zeiss). 


Immunofluorescence intensity measurements 

To quantify the intensity of the immunostaining for proteins at the 
adherens junctions (a-catenin, «18-catenin, §-catenin, p120-catenin) 
and Paxillin, ImageJ was used to measure the integrated density, a 
well-established method of measuring fluorescence intensity that 
accounts for differences in area” of the signal in the basal layer of the 
skin (labelled by Krt14). To quantify the integrated density of the signal 
of the tight junction proteins ZO-1 and Claudin-1, the measurements 
were taken onthe upper suprabasal layers. Images are shown with the 
pseudo-colour Fire from Image] after they have been contrasted equally 
and background was uniformly removed with Adobe Photoshop. 


Immunohistochemistry 

For staining on paraffin sections, 441m paraffin sections were depar- 
affinized and rehydrated. Antigen unmasking was performed for 20 
min at 98 °C in citrate buffer (pH 6) using the PT module. Endogenous 
peroxidase was blocked using 3% H,O, (Merck) in methanol for 10 min 
at room temperature. Endogenous avidin and biotin were blocked 
using the Endogenous Blocking kit (Invitrogen) for 20 min at room 
temperature. Nonspecific antigen blocking was performed using 
blocking buffer. Rabbit anti-YAP (1:200, Santa Cruz Biotechnology, 
sc-15407), rabbit anti-TAZ (1:100, Sigma-Aldrich, HPAOO7415), rabbit 
anti-cFOS (1:100, Proteintech, 26192-1-AP), rabbit anti- MYH9 (1:500, 
Sigma, HPAO01644), rabbit anti-pERK (1:200, Cell signaling, 4370S) 
were incubated overnight at 4 °C. For anti-FOSL1 (1:500, Santa Cruz 
Biotechnology, sc-28310) and anti-B-catenin (mouse, 1:1,000 Abcam 
ab6301) tissue were also blocked with Mouse on Mouse (M.O.M.) Block- 
ing Reagent (Vector Laboratories). Anti-rabbit biotinylated with block- 
ing buffer, anti-mouse biotinylated with blocking buffer, Standard 
ABC kit, and ImmPACT DAB (Vector Laboratories) were used for the 
detection of horseradish peroxidase (HRP) activity. Slides were then 
dehydrated and mounted using SafeMount (Labonord). 


Proliferation experiments 

For BrdU experiments, mice were injected witha single intraperitoneal 
injection of BrdU (50 mg/kg in PBS) at different time points and killed 4 
hafter. For the quantification an area of at least 1.5 mm’ per mouse was 
analysed with Zen2012 (Black Edition) software (Zeiss) to determine 
the percentage of BrdU-positive cells. 


Trans-epithelial water loss measurement 

TEWL was measured on back skin of isoflurane-sedated mice using 
a TEWA meter TM210 (Courage and Khazaka). Every measurement 
represents the average of 30 individual subsequently recorded meas- 
urements at the probe head. 


Transmission electron microscopy 

The skin was cut into pieces of approximately 1 cm? and pre-fixed 2h 
at room temperature in the fixative solution composed of 4% para- 
formaldehyde (Sigma-Aldrich) and 2.5% of electron microscopy grade 
glutaraldehyde (Sigma-Aldrich) diluted in 0.IM cacodylate buffer 
pH 7.4 (Sigma). Subsequently the pieces were cut again to reach an 
approximate size of 1 mm? and immersed in the fixative solution at 
4°C overnight. After washing 6 times for 10 min in 0.1 M cacodylate 
buffer on ice under slow rotation, the samples were post-fixed for 2h 
in 2% osmium tetroxide and again washed 6 times for 10 min in water. 
Samples were then stained with 0.5% uranyl acetate overnight at 4 °C 
in the dark. The following day the samples were washed 6 times for 
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10 min in water and incubated with lead aspartate (Walton’s lead aspar- 
tate composed of 0.02 M lead nitrate in 0.03 M sodium aspartate pH 5.5) 
for 45 min at 60 °C inthe dark. After the samples were washed in water 
(6 x 10 min) and dehydrated in a graded ethanol series of 50, 70 and 
90% on ice (20 min steps) and 100% ethanol (3 x 20 min) before being 
placed in propylene oxide (2 x 20 min). The samples were infiltrated 
in a mixture of 2:1 propylene oxide/epoxy resin (Agar) for 1h at room 
temperature and ina mixture of 1:1 propylene oxide/epoxy resin for1h 
at room temperature before being left overnight in 1:2 propylene oxide/ 
epoxy resin mixture with open lid to allow evaporation of propylene 
oxide. The next morning, samples were placed in100% epoxy resin that 
was changed every 2h and embedded at the end of the day in 100% fresh 
resin. They were incubated overnight at 45 °C and at 60 °C for 2 days to 
allow polymerization. Samples were processed into 50-70 nm ultrathin 
sections using a Leica Ultracut UCT ultra-microtome. Micrographs were 
taken ina JEOLJEM1400 transmission electron microscope equipped 
with an Olympus Quemesa 11 Mp cameraat 80 kV accelerating voltage. 


Dissociation of epidermal cells, cell sorting and microarray 
analysis 

The mice were shaved with an electric shaver, the back skin was dis- 
sected, and the fat and underlying subcutis were removed with a 
scalpel. The samples were placed in 0.25% Trypsin (Gibco, Thermo 
Fisher Scientific) in DMEM (Gibco, Thermo Fisher Scientific) and 2mM 
EDTA (Thermo Fisher Scientific) overnight at 4 °C. The epidermis was 
separated from the dermis and the trypsin was neutralized by adding 
DMEM medium supplemented with 2% Chelex Fetal Calf Serum (FCS). 
Samples were filtrated on 70 and 40 pm filters (Falcon). Single-cell 
suspensions were incubated in 2% FCS/PBS with primary antibodies 
for 30 min onice, protected fromthe light, with shaking every 10 min. 
Primary antibodies were washed with 2% FCS/PBS and cells incubated 
for 30 min in APC-conjugated streptavidin (BD Biosciences), on ice, 
with shaking every 10 min. Living epidermal cells were gated by forward 
scatter, side scatter and negative staining for Hoechst dye. Basal IFEand 
infundibulum cells were stained using PE-conjugated anti-a6-integrin 
(clone GoH3, 1:200, ebioscience) or FITC- conjugated anti-a6-integrin 
(clone GoH3, 1:200, ebioscience) and bulge cells were stained with bioti- 
nylated CD34 (clone RAM34; 1:50, BD Biosciences). Basal cells from the 
IFE were targeted using CD34 negative and a6 integrin positive gating. 
The geometric mean fluorescence intensity of ItgB1 (FITC-coniugated 
anti-B1-integrin, 1:100, BD Bioscience 555005) and Itga2 (PE-coniugated 
anti-a2-integrin, 1:100, BD Bioscience 553858) was measured on the 
a6-integrin"2"CD34° population. 

Fluorescence-activated cell sorting analysis was performed using 
FACSAria | at high pressure (70 psi) and FACSDiva software (BD Bio- 
sciences). Sorted cells (300 cells per sample) were harvested directly 
in 45 pl of lysis buffer (20 mM DTT, 10 mM Tris.HCl ph 7.4, 0.5% SDS, 
0.5 pg/pl proteinase K). Samples were then lysed at 65 °C for 15 min and 
frozen. RNA isolation, amplification and microarray were performed 
in the Functional Genomics Core, Barcelona. cDNA synthesis, library 
preparation and amplification were performed as described in*. Micro- 
arrays were then performed on Mouse Genome 430 p.m. strip Affym- 
etrix array. The data were normalized using RMA algorithm. The entire 
procedure was repeated in three technical independent samples for 
the control and the EXP D4 condition and in two technical independent 
samples for the TPA treatment. Genetic signatures were obtained by 
considering genes presenting a fold change greater or smaller than 2 
or —2, respectively in both biological replicates. 


RNA extraction and quantitative PCR 

Fresh-frozen pieces of mouse skin were reduced to powder with a 
mortar and RNA extraction was then carried out with the RNAeasy 
Microkit from Quiagen according to the manufacturer’s protocol. 
Purified RNA was used to synthesize the first-strand complementary 
DNA using SuperScript Il (Invitrogen) with random hexamers (Roche). 


Quantitative PCR analyses were carried out with Light Cycler 96 (Roche). 
Primers used: TNFa-5’: ACCACGCTCTTCTGTCTACT, TNFa-3’: AGGAGG 
TTGACTTTCTCCTG, IFNy-5’: TCAAGTGGCATAGATGTGGAA, IFNy-3’: 
CACTCGGATGAGCTCATTGA, CCL2-5’: CAGGTCCCTGTCATGCTTCT, 
CCL2-3’: GTCAGCACAGACCTCTCTCT, CCL5-5’: ACCATGAAGATCTCTG 
CAGC, CCLS5-3’: TEAACCCACTTCTTCTCTGG. Normalizers: HPRT-5’: 
GCAGTACAGCCCCAAAATGG, HPRT-3’: TCCAACAAAGTCTGGCCTGT, 
Gapdh-5’: CGTGTTCCTACCCCCAATGT, Gapdh-3’: GTGTAGCCCAAG 
ATGCCCTT. 


ATAC sequencing 

For ATAC-seq, 100000 sorted basal cells were collected in1 ml PBS sup- 
plemented with 3% FBS at 4 °C. Cells were centrifuged and cell pellets 
were resuspended in 100 ul lysis buffer (TrisHCI 10 mM, NaCl10 mM, 
MgCl2 3 mM, Igepal 0.1%) and centrifuged at 500g for 25 min at 4 °C. 
Supernatant was carefully discarded and nuclei were resuspended 
in 50 pl reaction buffer (Tn5 transposase 2.5 pl, TD buffer 22.5 pl, 
from Nextera DNA sample preparation kit, Illumina, and 25 pl H,0). 
The reaction was performed at 37 °C for 30 min and was stopped by 
adding 5 pl clean up buffer (NaCI 900 mM, EDTA 300 mM). DNA was 
purified using the MiniElute purification kit (QIAGEN) following the 
manufacturer’s protocol. DNA libraries were PCR amplified (Nextera 
DNA Sample Preparation Kit, Illumina), and size selected from 200 
to 800 bp (BluePippin, Sage Sciences), following the manufacturer’s 
recommendations. 


ATAC-seq analysis 

Adaptor sequences were removed with TrimmomaticPE using options 
‘HEADCROP:10 CROP:70 ILLUMINACLIP:adaptor.file:2:30:10 LEAD- 
ING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:60’ . ATAC-seq 
paired-end reads were then aligned to mouse genome Grcm38 
using Bowtie2 (v.2.2.6)” using options ‘-X 2000-fr-very-sensitive- 
no-discordant-no-unal—-no-mixed —-non-deterministic’. Mitochondrial 
reads, reads from unmapped or random counting and reads with a 
mapping quality <20 were removed using SAMtools®. Duplicate reads 
were removed by Picard tools (http://broadinstitute.github.io/picard/). 

Peak calling was performed on each individual sample by MACS2 
(v.2.1.0.20151222)* using options ‘-f BAMPE -g mm -nomodel- 
call-summits -B -SPMR’, q-value 0.0001. Peaks from the different 
subpopulations were merged for downstream analysis. 

Reads counts of each merged peak for each individual sample 
were calculated by HTSeq-count® using options ‘-f bam -r pos -m 
intersection-nonempty’. These counts were normalized for one million 
mapped reads in merged peaks and fold-change was calculated com- 
pared to control. Peaks were associated to genes with GREAT software® 
with the following parameters: 5.0 kb in proximal upstream, 1.0 kbin 
proximal downstream and 100.0 kb in distal. For most of the analysis, 
only peaks annotated to at least one gene were kept. 

Differential peaks are defined as peaks having at least a twofold 
change compared to control and being called peak in the expanded 
condition and contain at least 3 reads per million. 

De novo motif search was performed using findMotifsGenome.pl 
program in HOMER software” searching for motifs of 6 to12 bpina 
region of 500 bp around the peak centre. 


Single-cell RNA sequencing 

Single basal cells from the back skin were FACS sorted (Integrin alpha 
6 high, Cd34 negative) after dissociation. 6000 cells were loaded 
onto each channel of the Chromium Single Cell 3’ microfluidic chips 
(V2-chemistry, PN-120232, 10X Genomics) and barcoded with a 10X 
Chromium controller according to the manufacturer’s recommenda- 
tions (1OX Genomics). RNA from the barcoded cells was subsequently 
reverse transcribed, followed by amplification, shearing 5’ adaptor 
and sample index attachment. The Libraries were prepared using the 
Chromium Single Cell 3’ Library Kit (V2-chemistry, PN-120233, 10X 


Genomics) and sequenced on an Illumina HiSeq 4000 (paired-end 
100bp reads). 


Single-cell transcriptomic data analysis 

Sequencing reads were aligned and annotated with the mm10-1.2.0 ref- 
erence dataset as provided by 10X Genomics and demultiplexed using 
CellRanger (v.2.1.1) with default parameters. Further downstream analy- 
ses were carried out individually for each of the four samples (CTRL, EXP. 
D1, EXP.D4 and TPA). Quality control was performed using the scaterR 
package (v.1.10.0)*®. For each sample, all of the cells passed the following 
criteria: had more than 4,000 UMI counts, showed expression of more 
than1,500 unique genes and had less than 10% UMI counts belonging to 
mitochondrial sequences. Read counts were normalized using decon- 
voluted cell-specific size factors with the scran R package (v.1.10.2). 
Cell-cycle phases were assigned using the cyclone function from scran®. 
Dimensional reduction and clustering were performed using the Seurat 
R package (v.3.1.1). A PCA for each sample was calculated using the 
scaled expression data of the most variable genes (identified as outliers 
onamean/variability plot, implemented in the FindVariableGenes func- 
tion from Seurat). To remove unwanted sources of variability caused 
by differences in cycling stages but retaining the distinction between 
cycling and non-cycling cells, the differences between S-phase and 
GS/M-phase scores (determined by cyclone) were regressed out dur- 
ing scaling (by setting the vars.to.regress parameter of the ScaleData 
function in Seurat). UMAP“ calculation and graph-based clustering 
were done for each sample using the appropriate functions from Seurat 
(default parameters) with the respective PCA results as input. Different 
values of the resolution parameter were tested for the clustering. We 
set the final resolution to 0.6 (after testing a range from 0.3-1.2) first 
in the control sample, and then applied the same parameters to the 
other samples. Given that the obtained clustering sensitivity for a given 
resolution is dependent onthe number of cells of that subpopulation 
ineach respective sample, we swept over the same range of resolutions 
for the other samples, to assure the presence/absence of described 
clusters in all samples. This resolution best reflected the biological 
heterogeneity that emerged from the lineage tracing and the analysis 
of the clonal data, identifying a progenitor population (characterized 
by intermediate levels of Krt5, Krt14, Krt1 and Krt10) that emerged as an 
intermediate cluster between the stem cells (that expressed high levels 
of Krt14, Krt15 and integrins) and the differentiated population (highly 
enriched in Krti and Krt10). A Wilcoxon rank-sum test was used to define 
marker genes for each cluster. Benjamini-Hochberg FDR correction 
for potential cluster marker genes across all samples (n=10,651) using 
the p.adjust method in R and only markers expressed in at least 25% of 
cells of the cluster, having an average log-fold change of at least 0.25 
were reported. Differentially expressed genes and regulons between 
the CTRL SCs and EXP DISTRETCH SCs and SCs clusters were computed 
as two-sided Wilcoxon ranked sum tests using the FindMarkers func- 
tionin Seurat and p-values were FDR adjusted for the total number of 
comparisons (n=542 and n=571 respectively). Pseudotime ordering of 
cells was calculated using slingshot (v.1.1.0)””. We checked the robust- 
ness of the resulting trajectories by performing the analysis on PCA 
and UMAP reductions and also on downsampled subsets of the data to 
halfthe dataset size, these different reductions and permutations did 
not affect the described trajectories. Gene regulatory network analysis 
was performed using SCENIC” with default parameters. Differentially 
activated regulons for each cluster were determined by performing a 
Wilcoxon rank-sum test onthe normalized regulon AUC values of cells 
in the various clusters, p-values were FDR-adjusted using the Benja- 
mini-Hochberg method and regulons with an adjusted p-value less 
than 0.05 were considered differentially activated. For visualization 
purposes we integrated the CTRL, EXP D1, EXP D2 and TPA samples 
using the Seurat package’s standard CCA-MNN based data integration 
workflow. Feature selection was performed using the FindVariableFea- 
tures function fom Seurat with default parameters, selecting the 2,000 


most variable genes. Canonical correlation analysis (CCA) followed 
by integration anchors selection was then performed on the selected 
features using the FindIntegrationAnchors function from Seurat, taking 
the first 20 dimensions from the CCA into account, as described by the 
Seurat authors”. These anchors were then used to integrate the data 
with the /ntegrateData function. 

To compare the single-cell RNA-seq data from the treated EXP D2 
samples (EXPD2-Trametinib and EXP D2-MAL inhibitor), data integration 
was performed with the EXP D1 sample following the approach imple- 
mented in Seurat™. The samples with the treatments were sequenced at 
a different time point than the EXPD1 sample and had in general lower 
UMI count distributions. Therefore, the UMI counts matrix of the EXPD1 
sample was first downsampled, using the downsampleMatrix function 
from the DropletUtils package (https://bioconductor.org/packages/ 
release/bioc/html/DropletUtils.html), such that its total number of 
counts was the same as the average of the total number of counts of 
the treatment samples. QC filtering and pre-processing was then per- 
formed as described before for each individual sample before integra- 
tion. Further downstream analyses such as dimensionality reduction 
and clustering were all performed as described before using the inte- 
grated data. Cell-cycle phase prediction and differential expression 
analysis were performed on the uncorrected data to avoid interfer- 
ing with the underlying distribution assumptions of these statistical 
methods. Cluster annotation was mainly performed by comparing the 
overlap of the new clusters of the integrated data with the original cell 
type annotations of the EXPD1 sample and by studying the expression 
of known marker genes. Differentially expressed genes and regulons 
between the EXP D1SCs and the EXP D2 SC clusters were computed as 
two-sided Wilcoxon ranked sum tests using the FindMarkers function 
in Seuratand p-values were FDR adjusted for the total number of com- 
parisons (n=13,786 and n=758 respectively). Subpopulations that are 
not associated with the IFE (infundibulum and sebaceous gland) were 
removed for the final analysis. 


Statistical analysis 

Two-tailed Student’s t-test, two-tailed Mann-Whitney U-test and Wil- 
coxon signed-rank test were performed using GraphPad Prism v.7.00 
for Mac (GraphPad Software). Bar graphs and dot plots were generated 
by GraphPad Prism and show mean +s.e.m. unless otherwise indicated. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Data associated with this study have been deposited in the NCBI Gene 
Expression Omnibus under accession numbers GSE126231, GSE126734 
and GSE146637, respectively, for the microarray, ATAC-seq and 
scRNA-seq. Data supporting the findings of this study are available 
within the Article (and its Supplementary Information files). Source 
data are provided with this paper. 


Code availability 


Custom computer code and algorithms used to generate results that 
are reported in the paper are available within the article (and its Sup- 
plementary Information files) and from the corresponding authors 
on reasonable request. The code used for the modelling of the clonal 
data has been deposited in GitHub (available at https://github.com/ 
BenSimonsLab/Aragona_Nature_2020). 
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Extended Data Fig. 1| Amouse model of mechanical stretch-mediated skin 
expansion. a, Representative photographs of mice with the skin expander 
immediately after surgery at day (D) DO,D2,D4 and in control (CTRL) 
condition. Scale bars, 10 mm. The device was implanted on the back skin of the 
animals, close to the neck where the rigidity of the proximate cervical spines 
allows the hydrogel to stretch the skin during the inflation of the expander. 
Control mice were operated upon similarly but without introducing the 
hydrogel. b, Timeline of the experiment. CD1 mice were operated to place the 
expander and followed over time. c, Scheme showing the growth of the 
hydrogel. The arrows indicate the radius of the hemisphere. d, Hydrogel 
volume (measured by the height of the hydrogel and calculated as the volume 
of ahemisphere, see Methods, n=5 DO, n=13 DO.5,n=13 D1, n=13 D2,n=7 D3, 
n=13D4,n=10D6,n=6D8,n=8 D10,n=5D14 mice). e-n, TEM of ultrathin 
sections of control (e, g—j) and expanded (f, k-n) epidermis. Ine and f, dashed 
yellow lines denote dermal-epidermal boundary and boxed areain pink, 
cerulean, orange and green are shownat higher magnification, respectively, in 
gandk,handI,iandm,jandn. Scale bars, 5 um. g,k, Keratin bundles. 

h, L, Ultrastructural analysis of cell-cell adhesion. i,m, Desmosomes. 


j,n, Hemidesmodomes. o, Quantification of the intercellular spacing on 
images asinhand I. Wilcoxon signed-rank test, two-sided. p, Quantification 
of the width of the desmosomes asiniand m.q, Quantification of the width 

of the number of hemidesmosomes per pminjandn.r, TEWL measurements 
fromn=3CD1 micein CTRL andat different time point during expansion. 

s, Immunohistochemistry for the adherens junctions (AJ) component 
B-catenin, n=3 independent experiments. t, v, Representative images of AJ 
component p120-catenin (t) and E-cadherin (v) colour-coded for the signal 
intensity with Image]. Protein expression is visualized as a colour gradient 
going from black to yellow, with black as indicator of no expression and yellow 
as indicator of maximal expression. Scale bars, 10 pm. u, w, Quantification of 
the average integrated density signal for p120-catenin (u) and E-cadherin (w). 
Each data point is the average of 3 sections per mouse (n=3 mice per 
condition). o-q, The quantifications are made onn=3 different animals per 
condition on 10 different samples per mouse and represented as mean + s.e.m. 
g-n, Scale bars, 500 nm.d, p-r, u, w, Two-tailed Mann-Whitney test, 

mean +s.e.m. 
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Extended Data Fig. 2 | Adhesion remodelling and inflammatory response 
during stretch-mediated skin expansion. a,c, e, Representative images of 
the tight junction (TJ) components ZO-1 (a) and Claudin-1 (c) and of Vinculin (e) 
colour-coded for the signal intensity with ImageJ. Protein expressionis 
visualized as a colour gradient going from black to yellow, with black as 
indicator of no expression and yellow as indicator of maximal expression. Scale 
bars, 10 um. b, d, f, Quantification of the average integrated density signal for 
ZO-1(b), Claudin-1 (d) and Vinculin (f). The number of mice per condition is 
indicated. g, i, Immunostaining for K14 (red), inflammatory cells stained with 
CD45 (g) and macrophages stained with CD68 (i) (green) and Hoechst for 
nuclei (blue) on tissue sections. Scale bars, 10 pm. White arrows indicate 
positive cells, n=3 independent experiments. h,j, Percentage of CD45 (h) and 


CD68 (j) positive cells on the total dermal cells quantified based on the nuclear 
staining, n=3 mice per condition, mean per mouse + s.e.m.k, MRNA 
expression analysis for the indicated gene in Untreated (Unt., black) skin and 
skin treated with Dexamethasone (Dexa., grey). Fold change is expressed 
compared to one Unt. sample, n=3 mice per condition, mean per 

mouse +s.e.m.1, Maximum intensity projection of confocal pictures showing 
immunostaining for K14 (red), BrdU (green) and Hoechst for nuclei (blue) 4h 
following BrdU administration on whole mount epidermis. Scale bars, 10 pm. 
m, Proportion of basal cells that are BrdU positive (n =3,694 cells counted from 
3 mice for Untreated and n=3,764 cells from 3 mice for the Dexamethasone 
treatment).a,c,e, g,i, Dashed lines indicate the basal lamina. b, d, f,m, 
Two-tailed Mann-Whitney test, mean per mouse + S.e.m. 
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Extended Data Fig. 3 | Clonal analysis of epidermal stem cells during 
homeostasis, TPA treatment and stretch-mediated skin expansion. 

a, Genetic labelling strategy used to trace K14 IFE SC inthe back skin during 
homeostasis and stretch-mediated tissue expansion. b, Timeline of the 
experiment. Krt14-creER-Rosa©@" mice were induced with Tamoxifen at 2 
months of age and operated upon 3.5 days after to place the expander. The 
samples were collected 0,1, 2,4, 8,10 and 14 days after surgery. c, Raw 
distribution of clone size taken from mouse back skin under normal 
homeostatic conditions (CTRL) at different time points based on basal (top) 
and total (bottom) cell number. Note that times are calibrated so that the “day 
0” time-point is acquired 3.5 days after Tamoxifen injection, requiring effective 
chase times to be calibrated accordingly, see b. DO: 115 clones fromn=7 mice; 
D2:175 clones fromn=7 mice; D4: 136 clones fromn=5 mice; D8:159 clones 
fromn=3 mice; D10:146 clones fromn=3 mice. d, Time line of the experiment 
to perform clonal tracing upon TPA treatment. Krt14-creER-RosaC"™ mice 
were induced with Tamoxifen at 2 months of age and after 3.5 days topically 
treated with 12-O-Tetradecanoylphorbol-13-acetate (TPA) for 2 consecutive 
days. The samples were collected 1and 14 days after treatment. e, Maximum 
intensity projection of representative confocal pictures showing 
immunostaining for K14 (red) and BrdU (green) following BrdU administration 
on whole mount epidermis form mice treated with TPA or with vehicle (CTRL). 
Hoechst nuclear staining in blue. Scale bars, 20 pm. f, Percentage of BrdU 
positive cells in control and mice treated with TPA at D1 (n=5). Two-tailed 
Mann-Whitney test, mean + s.e.m. g, Raw distribution of clone size taken from 
mouse back skin during TPA treatment (TPA) based on basal (top) and total 
(bottom) cell number. D1: 85 clones from n=4 mice; D14: 54 clones fromn=5 
mice. h, Raw distribution of clone size taken from mouse back skin under 
stretch-mediated tissue expansion (EXP) at different time points based on 


basal (top) and total (bottom) cell number. As with control, note that times are 
calibrated so that the “day 0” time-point is acquired 3.5 days after Tamoxifen 
injection, requiring effective chase times to be calibrated accordingly. D2: 231 
clones fromn=4 mice; D4: 197 clones from n=4 mice; D8:199 clones fromn=4 
mice; D10:157 clones fromn=4 mice. i, Table showing the abundance (raw 
counts) of clones by their basal and suprabasal cell composition from the CTRL 
D2 condition (that is, 5 days post-induction), n=203 clones from7 mice.j, Table 
showing the abundance (raw counts) of clones by their basal and suprabasal 
cell composition from the EXPD2 condition, n=283 clones from 4 mice. k, Fit of 
the one-progenitor model to the average size of persisting clonesin control 
conditions based on the basal (black) and total (blue) cell content. Points show 
data and lines are the results of the fit to aone-compartment model 

(see Methods). I, m, 0, Fit to the one-progenitor cell model. Clone persistence 
(I), labelled cell fraction (m), and the distribution of basal (upper) and total 
(lower) clone size (0). Points show data and lines are the results of the fittoa 
one-progenitor model. k-m, o, DO: 115 clones from n=7 mice; D2:175 clones 
fromn=7 mice; D4: 136 clones fromn=5 mice; D8:159 clones fromn=3 mice; 
D10: 146 clones fromn=3 mice; D14:195 clones fromn=4 mice. n, p, Sensitivity 
analysis of the model fits depicted as a map of the total square-differences of 
the experimental basal/total clone size data and the respective model 
predictions asa function of the average division time, 1/A, and the degree of 
imbalance towards stem cell loss/replacement, r, (see Methods). Panel nshows 
the results of one-progenitor model and the CTRL data, p shows the results of 
two-progenitor model and the CTRL data. These results show both the 
enhanced accuracy of the two-progenitor model over the one-progenitor 
model, despite involving the same number of fit parameters. k-m, Mean +s.d. 
o, Mean+s.e.m. 
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Extended Data Fig. 4| Fit of the data to the two-progenitor model. a, Fit of 
the model to the clone size distribution under homeostatic control conditions. 
Note that, with 1/A =4.6 days and r= 0.21, the model faithfully reproduces both 
the exponential-like clone size distribution and the predominance of clones 
bearing an even number of basal and total cell numbers. Mean+s.e.m. DO: 
n=115 clones from7 mice; D2:n=175 clones from7 mice; D4:n=136 clones 
from5 mice; D8:n=159 clones from 3 mice; D10:n=146 clones from 3 mice; D14: 
n=195 clones from4 mice. b, Change of division time (1/A) during stretch-mediated 
expansion as parameterised from the measured rate of BrdU incorporation, 

Fig. le. c, Change in the probability of symmetric division (parameter, r) 

during stretch-mediated skin expansion obtained froma fit of the two- 
compartment model tothe clone size data (for details, see Supplementary Note). 
d, Corresponding fit of the two-compartment model to the clone size 
distribution during stretch-mediated expansion. The model accurately 
reproduces both the exponential-like clone size distribution and the 
predominance of clones bearing an even number of basal and total cell 
numbers. Notably, the sharp increase in even-sized clones at long times can 
only be recovered by limiting the frequency of renewing divisions well below 
that of the control value. Mean + s.e.m. D2:n=231 clones from 4 mice; D4: 
n=197 clones from 4 mice; D8:n=199 clones from 4 mice; D10: n=157 clones 
from4 mice. e, Fit of the model to the clone size distribution at D14 under TPA 


treatment. Note that with 1/A = 2.3 days and r=0.15, the model faithfully 
reproduces both the exponential-like clone size distribution and the 
predominance of clones bearing an even number of basal and total cell 
numbers. Mean +s.e.m. D1:n=85 clones from 4 mice; D14:n=54 clones from5 
mice. f, g, Sensitivity analysis of the model fits depicted as a map of the total 
square-differences of the experimental basal/total clone size data and the 
respective model predictions asa function of the average division time, 1/A, 
and the degree of imbalance towards stem cell loss/replacement, r, 

(see Methods). Panels (f) shows the results of the results of two-progenitor 
model and the EXP data, and (g) shows the results of two-progenitor model and 
the TPA data. For the EXP data (f), we have imposed the measured relative 
variation of the proliferation rate (as inferred from BrdU incorporation) (Fig. le 
and panel (b)) and an inferred relative variation of the parameter as obtained 
fromamodel fit (c), while the two parameters in panel (f) represent variationin 
the net rates. h, Representative orthogonal confocal sections immunostained 
for Krt14 (red), Krt10 (green) following short-term BrdU (white) incorporation 
identifying cells biased for renewal (Krt14*Krt10 ), cells primed for 
differentiation (Krt14*Krt10*) and differentiated cells (Krt14 Krt10*). 

i, Percentage of the type of divisions in CTRL (108 divisions from n=4 mice) and 
EXP D2 (254 divisions from n=4 mice) based onshort-term BrdU tracing and 
staining as inh. Two-tailed Mann-Whitney test, mean +s.e.m. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5| Genetic signature of TPA-treated and expanded 
epidermis. a, Representative FACS plots showing the strategy used to isolate 
basal cells. Single living cells were gated by debris exclusion (P1), DAPI 
exclusion (P2), doublet elimination (P3) and basal IFE Integrin-a6"2" CD34" 
cell were sorted (P4).n=10 independent experiments. b,c, mRNA expression 
of genes that were upregulated in basal cells at EXP D4 (n=3) andincells 
treated with TPA (n= 2). These genes are related toa generic stress signature 
(b), regulating ECM remodelling and cytoskeleton, important for cell survival 
and cell cycle (c). Bars are mean withs.e.m. d, Representative images of Paxillin 
immunostaining colour-coded for the signal intensity with ImageJ. Protein 
expression is visualized as a colour gradient going from black to yellow with 
black as indicator of no expression and yellow as indicator of maximal 
expression. Scale bars, 10 pm e, Quantification of the average integrated 
density signal for Paxillin as ind. Each data point is the average of 3 sections per 
mouse (n=3 mice per condition). f, Geometric mean fluorescence intensity for 
the indicated integrin in CTRL (grey, n=4 mice) and EXP D4 (red, n=6 mice) 


from FACS analysis of basal IFE Integrin-a6"*" CD34" cells. g-k, ATAC-seq 
profiles showing increasing accessibility of chromatin regions that are 
specifically remodelled during mechanical expansion (CTRL in grey and EXP 
D2 in orange). 1, Quantification of the number of cells FOSL1+ in the basal layer 
related to Fig. 3d. m, Immunostaining on skin sections for JUN (white) in control 
and EXP D4.n, Quantification of the number of cells JUN+ in the basal layer 
related tom. o, Quantification of the number of cells p63+ in the +1 layer related 
to Fig. 3e. p, Quantification of the number of cells KLF4+ in the +1 layer related 
to Fig. 3f. q, Immunohistochemistry on paraffin sections for c-FOS in control 
and EXPD4. Scale bars, 20 pm. r, Quantification of the number of cells c-FOSL+ 
in the basal layer related to q.s, Immunofluorescence on tissue sections for 
pSTAT3 in green and K14 (red) to identified the epidermis. Scale bars, 20 tm. 

t, Quantification of the number of cells positive for pSTAT3 in the basal layer 
related tos.I,n,0,p,r,t,3 sections quantified per n=number of mice and total 
number of cells indicated in parentheses d, m, q,s, Dashed lines delineate the 
basal lamina.e, f, 1, n-p, r, t, Two-tailed Mann-Whitney test, Mean+s.e.m. 
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Extended Data Fig. 6 | Single-cell RNA sequencing clustering analysis. 

a, UMAP graphic representation of the CTRL, EXP D1, EXP D4 and TPA 
single-cell RNA-seq data, showing the graph-based clustering results 
annotated by cell type. The proliferating IFE stem cells (PROLIF. IFE SCs) are 

in light blue, the IFE stem cells cluster are in red (IFE SCs#1) and dark red 

(IFE SCs#2), the IFE committed cells (IFE CCs) cluster isin pink and the 
differentiated IFE cells (IFE DIFF.) arein green. The differentiated cells from the 
infundibulum (INF. DIFF.) arein grey, the stem cells of the infundibulum (INF. 
SCs.) are in black, the proliferating cells of the infundibulum (PROLIF. INF.) are 
in plum and the sebaceous gland cluster (SG) is in orange. The IFE stress cells 
(STRESS) are in dark grey and the cluster of stem cells stretch (SCs STRETCH) in 
yellow. n=16,651 cells. b, UMAP of the different samples (CTRL, EXP D1, EXP D4, 


TPA) using the same integrated projection. n=4,659 cells CTRL, n=4,934 cells 
EXP D1, n=2,716 cells EXP D4,n=4,342 cells TPA.c-k, UMAP plot of the CTRL 
sample colored by normalized gene expression values for genes identifying the 
IFE (c) versus infundibulum (d), the sebaceous gland (e) and the proliferating 
cells (f). Undifferentiated (g) and more differentiated cells (h) in the IFE 
identified the SCs cluster (i), the CCs cluster (j) and the differentiated stage (k). 
Gene expression is visualized as a colour gradient going from grey to yellow 
with grey as indicator of no expression (that is, expression values below or 
equal to the 50 percentile for that sample) and yellow as indicator of maximal 
expression. c-k, n=16,651 cells. 1, Table showing the specific marker genes 
used to annotate the different clusters. 
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Extended Data Fig. 7 | Single-cell RNA sequencing clustering analysis on assigned using the cyclone function from scran tool and visualized in the UMAP. 
the IFE cells. a, Integrated UMAP graphic representation of the IFE cells in Cellsin Glare in light blue, cells inG2/M are in orange and cells inS phase arein 
CTRL, EXP D1, EXP D4 and TPA single-cell RNA-seq data, showing the red. b-d,n=3,142 cells CTRL, n=3,756 cells EXP D1, n=2,145 cells EXP D4, 
graph-based clustering results annotated by cell type. The proliferating stem n=3,704 cells TPA. d, Percentage of cells in the different cycling phase 
cells (PROLIF.) are in light blue, the stem cells clusters are in red and dark red calculated on the total number of cells. e-—h, UMAP plot colored by normalized 


(SCs#2), the committed cells (CCs) cluster isin pink andthe differentiatedcells | gene expression values for the indicated gene and in the indicated sample. 
(DIFF.) are in green. The stress cells (STRESS) arein dark grey and the cluster of Gene expression is visualized as acolour gradient going from grey to yellow 
stem cells stretch (SCs STRETCH) in yellow. n=12,747 cells. b, UMAP of the with grey as indicator of no expression and yellow as indicator of maximal 
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Extended Data Fig. 8 | Pseudotime analysis for single-cell RNAsequencing. 
a, UMAP plots coloured by the degree of regulon activation for TFs 
differentially activated (AUC rank-sum test FDR corrected p-value <0.05) in 
the different conditions. Colour scaling represents the normalized AUC value 
of target genes inthe regulon being expressed as computed by SCENIC. 

b, Heat map representation of the top 20 gene expression changes along the 
inferred pseudotime trajectory computed with Slingshot for the CTRL IFE. 

c, Heat map representation of the top 20 gene expression changes along the 
inferred pseudotime homeostatic trajectory computed with Slingshot for the 
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EXP D1IFE. d, Heat map representation of the top 20 gene expression changes 
along the inferred pseudotime trajectory computed with Slingshot 
characterizing the stress state for the EXP D1IFE. b—d, Columns represent cells 
ordered by their position along the pseudotime trajectory; rows represent 
genes whose expression profiles show highest correlation (FDR-correted 
p-value <0.01) with the pseudotime variable, calculated using a generalized 
additive model (GAM). The colour scaling of the cells represents the 
normalized expression value ofa gene ina particular cell, scaled by Z-score. 
a-d,n=3,142 cells CTRL, n=3,756 cells EXP D1. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | Cell contractility in stretch-mediated tissue 
expansion. a, Scheme of the genetic strategy to delete Diaph3 inthe 
epidermis. b, Protocol to delete Diaph3 during stretch-mediated tissue 
expansion. c, Orthogonal views of confocal analysis of immunostaining for K14 
(red) marking basal cells and Phalloidin (green) to visualize F-actin and Hoechst 
for nuclei (blue) in whole mounts of IFEin CTRL froma CD1 mouse, EXP D1 from 
aCD1or Krt14-cre-DIAPH3fl/fl (Diaph3 cKO) mouse. Scale bars, 10 pm. 

d, Percentage of cells with F-actin fibres in the apical side of basal cells related 
toc(n=4 mice per condition). e, Orthogonal views of confocal analysis of 
immunostaining for K14 (red) marking basal cells, BrdU (green) and Hoechst 
for nuclei (blue) in whole mounts of IFE from Krt14-cre-DIAPH3f1/+ (Diaph3 WT) 
and Krt14-cre-DIAPH3f1/fl (Diaph3 cKO) mice during expansion. Scale bars, 

20 um. Epidermal Diaph3 cKO were born at a Mendelian ratio and did not 
present obvious pathological phenotypes. n=3 independent experiments. 

f, Immunostaining for the basal marker K14 (red) and the suprabasal markers K1 
and K10 (green) in Diaph3 WT and Diaph3 cKO mice in EXP D2 and EXP D4. Scale 
bars, 20 pm. g, Epidermal thickness of Diaph3 WT and Diaph3 cKO mice in EXP 
D2 and EXP D4 (three measurements taken with ImageJ on twosections per 
mouse, n=at least 3 mice for the different conditions). h, Scheme of the genetic 


strategy to delete Myh9 in the epidermis. i, Protocol to delete Myh9 during 
stretch-mediated tissue expansion. j, Immunohistochemistry for MYH9 in 
untreated and Tamoxifen induced Krt14-creER-MYH9fI/fl mice. Scale bars, 

20 um.n=3 independent experiments. k, Orthogonal views of confocal 
analysis of immunostaining for K14 (white), BrdU (red) and Hoechst for nuclei 
(blue) in whole mounts of IFEin Myh9 WT and Myh9 cKO mice during expansion. 
Scale bars, 20 pm. I, Epidermal thickness of Myh9 WT and Myh9 cKO mice inEXP 
D2 and EXP D4 (three measurements taken with Image] on twosections per 
mouse, n=at least 3 mice for the different conditions). m-r, Analysis of 
adherens junctions in Diaph3 cKO and Myh9 cKO mice. m, 0, q, Representative 
images of adherens junction (AJ) component a-catenin (m), the a18 tension 
sensitive form of a-catenin (a18-catenin) (0) and Vinculin (q), colour-coded for 
the signal intensity with Image). Protein expression is visualized as a colour 
gradient going from black to yellow, with black as indicator of no expression 
and yellow as indicator of maximal expression. Dashed lines indicate the basal 
lamina. Scale bars, 10 um. n, p, r, Quantification of the average integrated 
density signal for a-catenin (n), 018-catenin (p) and Vinculin (r). Each data point 
is the average of 3 sections per mouse (n=5 mice per condition). 

d,g,I,n, p,r, Two-tailed Mann-Whitney test, mean+s.e.m. 
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Extended Data Fig. 10| See next page for caption. 
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Extended Data Fig. 10 | MEK/ERK/AP1, YAP-TAZ and MAL/SRF regulate 
stretch-mediated proliferation. a, b, Protocol for Trametinib or Pimasertib 
treatment in CD1 mice operated to place the expander and scarified at D2, 

D4 (a) and D8 (b) after surgery. c, Immunohistochemistry for pERK on 

paraffin sections of epidermis form CD1 mice untreated or treated with the 
indicated drug at EXP D2. d, Quantification of the proportion of BrdU positive 
cells during expansion at the indicated time point in CD1 mice untreated 

or treated with Trimatenib or Pimasertib (n= at least 3 mice per condition 

as indicated, total number of cells analysed indicated in parentheses). 

e, f, Immunohistochemistry for FOSL1 (e) and immunofluorescence for JUN (f) 
onsections of epidermis from CD1 mice untreated or treated with the indicated 
drug at EXP D2. g, Epidermal thickness measured with ImageJ on tissue 
sections at EXP D8 in CD1 mice untreated or treated with the indicated drug 
(n=5 mice untreated, n=4 mice Trametinib, n=3 mice Pimasertib, 3 
measurements onat least 2 sections per mouse). h, Immunostaining (white) for 
YAP1onskin sections inthe control and in EXP D1. White arrows indicate 
nuclear localization. i, Quantification of YAP1 subcellular localization, bars and 
error bars represent the mean and s.e.m. Nuclear (N) > Cytosplasm (C), more 
YAP1in nucleus than incytoplasm, N=C, similar level of YAP1in nucleus thanin 
cytoplasm, N<C, less YAP1in nucleus than in cytoplasm (n =150 cells for all 
samples except n=120 for EXP D8).j, Quantification of MAL subcellular 
localization, presented as mean and s.e.m.N>C, more MAL in nucleus thanin 
cytoplasm, N=C, similar level of MALin nucleus thanin cytoplasm, N<C, less 
MAL innucleus thanin cytoplasm (n=150 cells for allsamples except n=120 for 
EXP D8).k, I, Immunostaining (white) for MAL (k) andJUN (I) onskin sections in 
the control and in EXP D1. White arrows indicate nuclear localization. 


m,n, Scheme of the genetic strategy to delete YAP-TAZ in the epidermis (m) and 
protocol to delete YAP and TAZ in stretch-mediated tissue expansion (n). 

o, Immunohistochemistry for YAP (top) and TAZ (bottom) in Krt14-creER-YAP- 
TAZfl/flmice before and after Tamoxifen administration. p, Orthogonal views 
of confocal analysis of immunostaining for K14 (red) marking basal cells, BrdU 
(green) and Hoechst for nuclei (blue) in whole mounts of IFE in YAP-TAZfI/fl 
(YAP-TAZWT) or Krt14-creER-YAP-TAZfU/fl (YAP-TAZ cKO) mice at the indicated 
time point following expansion. q, Epidermal thickness of YAP-TAZWT and YAP- 
TAZ cKO mice in EXP D2 and EXP D4 (three measurements taken with Image] on 
two sections per mouse, n=at least 4 mice per condition). r, Protocol to inhibit 
MAL with the CCG203971 small molecule during stretch-mediated tissue 
expansion. s, Quantification of MAL subcellular localization in EXP D2 and EXP 
D4 mice treated or not with the MAL inhibitor. N>C, more MAL in nucleus than 
in cytoplasm, N=C, similar level of MAL in nucleus than in cytoplasm, N <C, less 
MAL innucleus thanin cytoplasm (n=150 cells per condition). Data are 
presented as meanands.e.m. t, Orthogonal views of confocal analysis of 
immunostaining for K14 (red) marking basal cells, BrdU (green) and Hoechst 
for nuclei (blue) in whole mounts of IFEin mice treated with the MAL inhibitor 
or with vehicle control (untreated) at the indicated time point following 
expansion. u, Epidermal thickness of CD1 mice in EXP D2 and EXP D4 treated or 
not with the MAL inhibitor (three measurements taken with ImageJ on two 
sections per mouse, n=3 for untreated mice, n=5 for treated animals). 
c,e,f,h,k,1,0, p, t, Scale bars, 20 um. n=3 independent experiments. 
c,e,f,h,k,I, 0, Dashed lines delineate the basal lamina. d, g,q,u, Two-tailed 
Mann-Whitney test, mean + s.e.m. 
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Extended Data Fig. 11| Pathways associated with stretch-mediated tissue 
expansion. a, Protocol used to delete YAP and TAZ and to inhibit MAL with 
CCG203971 treatment in Krt14-creER-YAP-TAZfl/fl mice in EXP D2. 

b, Orthogonal views of immunostaining for K14 (red) to mark basal cells, BrdU 
(green) and Hoechst for nuclei (blue) on whole mounts of IFEin YAP-TAZ WT 
untreated mice and YAP-TAZ cKO mice treated with the MAL inhibitor at 2 days 
after the expander placement. Scale bars, 20 pm. c, Proportion of BrdU positive 
cellsin untreated YAP-TAZWT mice (48,615 cells from 3 mice) andin YAP-TAZ 
cKO mice treated with the MAL inhibitor (78,282 cells from 5 mice). Two-tailed 
Mann-Whitney test, mean + s.e.m.d, Epidermal thickness of YAP-TAZWT 
untreated (n=3) and YAP-TAZcKO treated with the MAL inhibitor in EXP D2 
(n=5), three measurements taken with ImageJ on two sections per mouse. Two- 
tailed Mann-Whitney test, mean +s.e.m.e, f, Quantification of YAP1 (e) and 


MAL (f) subcellular localization, presented as mean and s.e.m. in CTRL and EXP 
D2.N>C, more proteinin nucleus than in cytoplasm, N=C, similar level of 
proteinin nucleus than in cytoplasm, N<C, less proteinin nucleus thanin 
cytoplasm (n=150 cells per condition). g, h, Percentage of the type of divisions 
in CTRL (g) and EXP D2 (h) in YAP-TAZWT mice and YAP-TAZ cKO mice based on 
the short-term BrdU tracing and staining as in Extended Data Fig. 4h. 

i,j, Percentage of the type of divisions in CTRL (i) and EXP D2 (j) in Untreated 
mice and with MAL inhibitor based on the short-term BrdU tracing and staining 
as in Extended Data Fig. 4h. k, I, Percentage of the type of divisions in CTRL (k) 
and EXP D2 (I) in Untreated mice and with Trametinib based onthe short-term 
BrdU tracing and staining as in Extended Data Fig. 4h. g-I, The number of 
counted divisions is indicated in parenthesis from n=number of mice. Two- 
tailed Mann-Whitney test, mean +s.e.m. 
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Extended Data Fig. 12| Single-cell data analysis after MEK and MAL 
inhibition. a, Average size of persisting clones in mice treated with MAL 
inhibitor during expansion, based on the basal (black) and total (blue) cell 
content. Points show data and lines denote the results from the fit to the 
two-compartment model (see main text and Methods). DO: n=115 clones from 
7 mice; D2:n=86 clones from 3 mice; D4: n= 83 clones from 3 mice. b, Average 
size of persisting clones in mice treated with Trametinib during expansion, 
based on the basal (black) and total (blue) cell content. Points show data and 
lines denote the results from the fit to the two-compartment model (see main 
text and Methods). DO:n=115 clones from 7 mice; D2:n=84 clones from3 mice; 
D4:n=80 clones from 4 mice; D8:n=81 clones from 3 mice. c, Fit of the model 
tothe clone size distribution during expansion upon MAL inhibition with 
1/A=3.8 days andr=0.08.DO:n=115 clones from7 mice; D2:n=86 clones from 


3 mice; D4: n= 83 clones from3 mice. d, Least-square values indicate the 
sensitivity of the fit parameters inc. e, Fit of the model to the clone size 
distribution during expansion upon Trametininb treatment with 1/A=4.3 days 
andr=0.17.D0:n=115 clones from7 mice; D2:n=84 clones from 3 mice; D4: 
n=80clones from 4 mice; D8:n=81clones from 3 mice. f, Least-square values 
indicate the sensitivity of the fit parameters ine. g, Predicted cell-cycle phases 
assigned using the cyclone function from scran tool of EXP D1 Untreated IFE, 
EXP D2 IFE treated with the MAL inhibitor and EXP D2 IFE treated with 
Trametinib. Cells in Glare in grey, cells in G2/M are in blue and cells inS phase 
areinred. The percentage of cells in the different cycling phases is calculated 
onthe total number of cells. h, Table showing the values of the percentage of 
the different cellular clusters in Fig. 4j,k.a,b, Mean+s.d.c,e, Mean+s.e.m. 
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Data exclusions No animals were excluded from the analysis. 


Replication All experiments were repeated at least three times ( 3 biological independent experiments/mice) showing similar results, except 
for microarray analysis for which one condition experiment (TPA) was repeated twice (2 biological independent experiments/mice) and single 
cell for which we analysed one mouse per condition (1 biological independent experiment/mouse). 
All attempts at replication were successfull and are shown, n is described in legends. 


Randomization | The experiments were not randomized. The mice included in this study were selected according to their correct genotype. The mice received 
tamoxifen injection/s when they were 2 months old. Sex-specific differences were minimized by including similar number of male and female 
animals. 


Blinding The investigators were not blinded to allocation during experiments and outcome assessment. Blinding was not possible as the same 
investigator processed the animals and analysed the data. 
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Antibodies used For immunofluorescence and ISH the following primary antibodies were used: 
anti-B4-integrin (Rat, 1:200, BD, clone346-11A, ref.553745, lot.5239648), anti-Keratin10 (rabbit, 1/3000, Covance, 
ref.PRB-159P-0100), anti-Keratin1 (rabbit, 1/3000, Covance, ref.PRB-165P-0100), anti-Keratin-14 (rabbit, 1/3000, Thermofisher), 
anti-BrdU ( mouse, 1/200, BD, clone 3D4,ref. 560209, lot.4293550), anti-MAL (rabbit,1/200, Sigma, ref.HPAO30782,lot.C106712), 
anti-c-JUN (rabbit, 1/200; Proteintech, 24909-1-AP); anti-p63 (rabbit, 1/100 Abcam, ab124762); anti-KLF4 (rabbit, 1/100 Abcam, 
ab129473); anti-pSTAT3 (rabbit, 1/100 Cell signaling, 9145); anti-YAP1 (rabbit, 1/100 Proteintech, 13584-1-AP); rabbit anti-YAP 
(1:200, Santa Cruz Biotechnology, sc-15407), rabbit anti-TAZ (1:100, Sigma-Aldrich, HPAOO7415), rabbit anti-cFOS (1:100, 
Proteintech, 26192-1-AP), rabbit anti-MYH9 (1:500, Sigma, HPAO01644), rabbit anti-pERK (1:200, Cell signaling, 4370S), anti- 
FOSL1 (1:500, Santa Cruz Biotechnology, sc-28310), anti-B-catenin (mouse, 1:1000 Abcam ab6301), anti-a-catenin (rabbit, 
1:1000 Sigma, C2081); anti-a18-catenin (rat monoclonal, 1:20000) was a generous gift form Akira Nagafuchi, Kumamoto 
University; anti-Vinculin (mouse, 1:200 Millipore MAB3574); anti-p120-catenin (rabbit, 1:1000 Invitrogen PA5-82545); anti-E- 
cadherin (rat, 1:500, ebioscience 14-3249-82); anti-ZO-1(rabbit, 1:200 Invitrogen 61-7300); anti-Claudin-1 (rat, 1:100Thermo 
Scientific 51-9000); anti-Paxillin (rabbit, 1:100 Abcam 32084); anti-CD45 (rat, 1:500, BD Biosciences 553081); anti-CD68 (rabbit, 
1:100 Abcam ab125212); anti-SOX6 (rabbit, 1:400 Sigma HPA001923); anti-NFIB (rabbit, 1:200 Sigma HPAQO3956). 
The following secondary antibodies were used: anti-rabbit, anti-rat, anti-goat, anti-chicken, anti-mouse conjugated to 
AlexaFluor488 (Molecular Probes) and to rhodamine Red-X and Cy5 (JacksonImmunoResearch). 


Validation As positive control for immunostaining, tissues with known expression of the marker were used on mouse sections. As 
negative control, for immunostaining, staining omitting the primary antibody was performed. 


Animals and other organisms 
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Laboratory animals 14CREER and K14CRE transgenic mice were kindly provided by E. Fuchs, Rockefeller University, USA. Rosa-Confetti mice were 
provided by H. Clevers. YAPfI/fl and TAZfI/fl mice were obtained from Georg Halder (KU Leuven) who received them from Randy 
ohnson at the MD Anderson Cancer Center, Houston. Diap3fl/fl mice were obtained from Fadel Tissil (UCL) and MYH9fI/fl mice 
were a kindly gift from Sara Wickstrom (University of Helsinki). Rosa26-mT/mG mice were provided form Isabelle Migeotte 
(Université Libre de Bruxelles). Mice included in this study were from mixed genetic background. 
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Ethics oversight ice colonies were maintained in a certified animal facility in accordance with European guidelines. The experiments were 5 


approved by the local ethical committee (CEBEA) under protocols #604 and #605.The study is compliant with all relevant ethical 
regulations regarding animal research. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Methodology 


Sample preparation The mice were shaved with an electric shaver, the back skin was dissected from the mice, fat and underlying subcutis were 
removed with a scalpel. The samples were placed in 0,25% Trypsin (Gibco, Thermo Fisher Scientific) in DMEM-Dulbecco's 
modified Eagle's medium (Gibco, Thermo Fisher Scientific) and 2mM EDTA (Thermo Fisher Scientific) overnight at 4°C. The 
epidermis was separated form the dermis and the trypsin was neutralized by adding DMEM medium supplemented with 2% 
Chelex Fetal Calf Serum (FCS). Samples were filtrated on 70 and 40um filter (Falcon). Single cells suspension were incubated in 
2% FCS/PBS with primary antibodies for 30 min on ice, protected from the light, with shaking every 10 min. Primary antibodies 
were washed with 2% FCS/PBS and cells incubated for 30 min in APC-conjugated streptavidin (BD Biosciences), on ice, with 
shaking every 10 min. Living epidermal cells were gated by forward scatter, side scatter and negative staining for Hoechst dye. 
Basal IFE and infundibulum cells were stained using PE-conjugated anti-a6-integrin (clone GoH3; 1/200, ebioscience) and bulge 
cells were stained with biotinylated CD34 (clone RAM34; 1:50, BD Biosciences). Basal cells from the interfollicular epidermis 
were targeted using CD34 negative and a6 integrin positive gating. The geometric mean fluorescence intensity of ItgB1 (FITC- 
coniugated anti-B1-integrin, 1:100, BD Bioscience 555005) and Itga2 (PE-coniugated anti-a2-integrin, 1:100, BD Bioscience 
553858) was measured on the a6-integrinhigh/CD34neg population. 


Instrument FACSAria III (BD Bioscience) 
Software FACSDiva Software 8.0.1 (BD Bioscience) 
Cell population abundance _ The proportion of the relevant cell populations is depicted in Extended Data Fig.3a 


Gating strategy Living cells were selected by forward scatter, side scatter, doublets discrimination and by Hoechst dye exclusion. Basal epidermal 
cells were selected based on the expression of Integrin-alhap6-high-PE and Cd34-APC negative. 
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Colonization by the microbiota causes a marked stimulation of B cells and induction 
of immunoglobulin, but mammals colonized with many taxa have highly complex and 
individualized immunoglobulin repertoires’. Here we use a simplified model of 
defined transient exposures to different microbial taxa in germ-free mice’ to 
deconstruct how the microbiota shapes the B cell pool and its functional 
responsiveness. We followed the development of the immunoglobulin repertoire in B 
cell populations, as well as single cells by deep sequencing. Microbial exposures at the 
intestinal mucosa generated oligoclonal responses that differed from those of 
germ-free mice, and from the diverse repertoire that was generated after intravenous 
systemic exposure to microbiota. The IgA repertoire—predominantly to cell-surface 
antigens—did not expand after dose escalation, whereas increased systemic exposure 
broadened the IgG repertoire to both microbial cytoplasmic and cell-surface 
antigens. These microbial exposures induced characteristic immunoglobulin 
heavy-chain repertoires in B cells, mainly at memory and plasma cell stages. Whereas 
sequential systemic exposure to different microbial taxa diversified the IgG repertoire 
and facilitated alternative specific responses, sequential mucosal exposure produced 
limited overlapping repertoires and the attrition of initial IgA binding specificities. 
This shows a contrast between a flexible response to systemic exposure with the need 


to avoid fatal sepsis, and a restricted response to mucosal exposure that reflects the 
generic nature of host-microbial mutualism in the mucosa. 


Mammalian immunoglobulins are composed of heavy (H) and light 
(L) chains, each of which is assembled from one of many Vj, (D,) and 
Jun gene segments during B cell development. The resulting genetic 
and structural diversity of the immunoglobulin repertoire, and anti- 
gen recognition possibilities, are further increased by additional 
nucleotide insertion during recombination, later somatic mutation 
or class-switch recombination from IgM and/or IgD to IgG, IgE or IgA. 
The selection and clonal expansion of particular B cells with appropri- 
ate immunoglobulin specificity endows the system with the ability to 
respond toa huge variety of antigens. Comparisons of colonized and 
germ-free mice show that the number of B cells and immunoglobulin 
levels (especially those of IgG and IgA) are considerably amplified by 
colonization with a microbiota, some members of which penetrate 
mucous membranes to prime systemic secondary lymphoid structures 
even in healthy hosts*>. B cell repertoires have so far been shown tobe 
largely unique to each individual'”, although whether this is caused by 
differences in microbiota composition or the sequence of colonization 
in each individual is unknown. 

Using localized time-limited exposures of defined doses of single 
benign microbial taxa in germ-free mice’, here we address howthe B cell 


repertoire is shaped by microbiota exposure at mucous membranes or in 
systemic lymphoid tissues, and howinteractions between different expo- 
sure sites or subsequent exposures to different taxa affect the outcome. 


Distinct repertoires depend on exposure site 


We used standardized doses of the live non-replicating Escherichia coli 
strain HA107 totransitorily expose germ-free mice to live microorgan- 
isms, either confined within the intestine or via the blood stream. After 
exposure, all mice rapidly return to germ-free status, owing to HA107 
auxotrophy for the essential bacterial amino acids D-alanine and diami- 
nopimelic acid’. We found distinct antibody repertoires computed on 
the basis of CDR3 heavy-chain amino acid identity’ from entire lgA, IgG 
and IgM repertoires, depending on whether the exposure was mucosal 
or systemic (Fig. la—c, Extended Data Fig. 1a). Clonotype overlap was 
significantly greater within each exposure condition compared with 
different conditions (right panels of Fig. 1la—c) and different exposure 
conditions segregated in unsupervised tree clustering (bottom panels 
of Fig. la—c). We also found a clear separation according to microbial 
exposure condition, with shared clonotypes computed as sequences 
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Fig. 1| Antibody repertoires in mucosal and systemic tissues after 
transitory oral or systemic exposure to acommensal microorganism. 
a-g, Germ-free mice (GF) were orally or systemically primed three times every 
other day by intragastric (i.g.) (10° colony-forming units (CFU)) or intravenous 
(i.v.) (108 CFU) E. coliHA107 (a-c, e-g), or C. orbiscindens (d), and compared 
with germ-free control mice. Sequencing of the immunoglobulin repertoire at 
21d for IgA (a, d), IgG2b (b) and IgM (c) inthe ileum, mesenteric lymph nodes 
(MLN), bone marrow (BM) and spleen was performed. In a-d, the left panels 
show multidimensional scaling (MDS) plots based onimmunoglobulin 
heavy-chain CDR3 amino acid sequences of the entire repertoire. In a—d, the 
right panels show Tukey plots of clonal overlap between samples of different 
priming conditions or the same priming condition (sample comparisons from 
the same mouse were excluded), shown with arithmetic mean and whiskers 
extending to1.5x interquartile range. Two-sided Wilcoxon rank-sum tests were 
performed with adjusted P values as shown. The bottom panels ina-d show 
hierarchical clustering. Dendrogram branch lengths show the distance 
between repertoires on the basis of CDR3 amino acid sequence similarity; 
vertical dotted lines designate principal unbiased separations. e, IgM, IgA and 
IgG2b of indicated sorted B cell populations at 21d. Inthe hierarchical 
clustering dendrograms from the indicated sorted cell populations, branch 
length shows the distance between CDR3 amino acid repertoires. 

f, g, Single-cell VDJ repertoire sequencing analysis. f, MDS plot of distance 
between repertoires on the basis of combined immunoglobulin heavy- and 
light-chain CDR3 amino acid sequences. Grey shading visually indicates 
germ-free repertoires. g, Hierarchical clustering of single-cell heavy- or 
light-chain immunoglobulin repertoires from mesenteric lymph nodes and 
spleen. Data are representative of nine (a—c), two (d, e) or three (f, g) 
independent experiments. All data points represent organs from individual 
mice. Rev., reversible. 


encoded by the same V,, and J,, segments and an identical nucleotide 
sequence in the CDR3 region’ (Extended Data Fig. 1b-g). 

We used Clostridium orbiscindens (German Collection of Microorgan- 
isms and Cell Cultures (DSMZ) accession 8061) as a second transitory 


colonizing taxon to confirm that the repertoires depend on the route 
of microbial exposure (Fig. 1d, Extended Data Fig. 1h). Flowcytometric 
sorting before analysis with standardized numbers of subsets of B cells 
showed thatthe distinct effects of different routes of microbial exposure 
occurred inthe memory and plasma cells (Fig. 1e). Although single-cell 
experiments currently only capture a fraction of the clonotype reper- 
toire compared with bulk analysis (in our case, 14+ 6%, X +s.d.,n=8 inde- 
pendent samples from 6 mice), a multidimensional scaling plot that 
combines the heavy- and light-chain information also segregated 
according to exposure route (Fig. 1f). Subsequent analysis that consid- 
ered single-cell heavy- and light-chain clonotypes individually showed 
that the selective effects of the route of microbial exposure segregate 
with the immunoglobulin heavy chain (Fig. 1g). 

Using bacterial-specific flow cytometry, we verified a preferential 
isotype class switch to IgG2b after systemic exposure, or to IgA after 
mucosal exposure’ (Extended Data Fig. 1i, j). Consistent with previ- 
ous reports® ”, we also found that mucosal exposure could generate a 
small serum IgG response (red symbols in Extended Data Fig. 1k). Con- 
versely, exclusive systemic exposure generated a specific secretory IgA 
response in the intestine (blue symbols in Extended Data Fig. 11), which 
confirms that both the mucosal and the systemic immune systems can 
be primed by microbial exposure from either route. 

Alternative approaches are being used by different laboratories 
to avoid the over estimation of clonal frequencies owing to sequenc- 
ing errors. We carried out a series of control experiments to confirm 
that equivalent biological clustering results were obtained with either 
computational consolidation or the use of unique molecular identifier 
barcodes (Extended Data Fig. 2a—c). Although each set of bulk sequenc- 
ing results in this Article are from samples processed together ina 
batch, we also verified the technical reproducibility of the same sample 
processed on different occasions (Extended Data Fig. 2d). 


Priming thresholds differ by exposure route 


In our system of non-replicating transitory exposures, in germ-free 
mice, test doses can be calibrated without further expansion in vivo. 
We showed that mucosal doses of 10°-10° colony-forming units were 
needed to reshape the mucosal IgH repertoire (Extended Data Fig. 3a, 
e, f), consistent with the high doses that are known to induce specific 
IgA binding? (Extended Data Fig. 3b). By contrast, the systemic IgH 
repertoire was reshaped by as little as 10,000 colony-forming units 
(Extended Data Fig. 3c, g,h). The doses required for detectable humoral 
responses were generally at least an order of magnitude greater than 
those required for repertoire programming (Extended Data Fig. 3b, d). 


Exposure route affects B cell targeting 


To explore whether repertoires of immunoglobulin heavy-chain clono- 
types differed according to mucosal or systemic microbial exposure, 
we investigated the targets for bacterial-antigen binding according 
to exposure route. Intestinal IgA bound most effectively to the puri- 
fied bacterial membrane fraction of E. coli, whereas serum IgG bound 
to both membrane and cytoplasmic fractions (Fig. 2a). Ribosomal 
proteins of £. coli dominated the cytoplasmic binding of serum IgG 
antibodies, and were confirmed in a second context by proteomic 
analysis in specific-pathogen-free mice that were systemically exposed 
to Enterobacter cloacae (Extended Data Fig. 4a—c). Increased exposure 
of bacterial membranes relative to the cytoplasmic contents in the 
mesenteric lymph nodes after intestinal treatment was shown by higher 
lipopolysaccharide/16S ribosome ratios compared with the spleen 
after systemic treatment (Extended Data Fig. 4d). These differences 
may be caused by the intestinal efficiency of bacterial lipid uptake com- 
pared with bacterial cytoplasmic protein contents, whereas systemic 
exposure would prime equivalent immune responses to membrane 
and cytoplasmic fractions after opsonization and phagocytosis®. 
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Fig. 2 | Differences between B cell repertoires after systemic or mucosal 
exposure. Germ-free mice were treated orally or systemically with E£. coli 
HA107 at the indicated doses, as in Fig. 1. a, Binding of systemic or intestinal 
antibodies (left, serum 1gG2b; right, intestinal wash IgA) to £. colimembrane 
versus cytoplasmic fractions evaluated using enzyme-linked immunosorbent 
assay (X +s.d.,n=6 mice, two-sided unpaired t-test). b, Rarefaction plots of 
immunoglobulin heavy-chain sequencing at 21d for IgA (left) and IgG2b (right) 
in mesenteric lymph nodes and spleen. c, Immunoglobulin heavy-chain 
sequencing at 21d for IgA in mesenteric lymph nodes (left) and spleen (left 
inset), and IgG2b in spleen (right) and mesenteric lymph nodes (right inset) 
(n=3 mice for each bacterial exposure group and each defined dose, 

n=15 germ-free control mice, *P=0.036 mesenteric lymph nodelgA (both 
cases), P=0.046 spleen, P= 0.036 mesenteric lymph node lgG2b, two-sided 
Wilcoxon rank-sum test). d, IgM (left), 1IgG2b (middle) and IgA (right) 
heavy-chain sequencing of splenic sorted B cell populations at 21d, n=6 mice 
for each condition. Tukey plots indicate the calculated number of clonotypes 


Insupport of this, repertoires of B cell clonotypes after systemic—but 
not mucosal—challenge with ultrasound-lysed bacteria overlapped 
with the repertoires after exposure with intact bacteria (Extended Data 
Fig. 4e, f). Conversely, inthe absence of local anatomical processing, the 
intrinsic antigen-specific and -nonspecific responses to membrane or 
cytoplasmic bacterial antigens presented directly in vitro did not differ 
depending on mesenteric or splenic origin of the B cells (Extended Data 
Fig. 4g). We conclude that the different repertoires from mucosal and 
systemic B cell priming are aconsequence of the way in which microbial 
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per 1,000 cellsinB cell populations. **P= 0.003, two-sided Wilcoxon rank-sum 
test. e, f, Single-cell sequencing analysis. Networks built on combined heavy- 
and light-chain CDR3 amino acid sequences each from single spleen 
IgG-expressing (e) or mesenteric lymph node IgA-expressing (f) B cells 
(singlets excluded; n=1,601 and 670 for eand f, respectively). Edges show 
Levenshtein distance =1 or 2. Networks show immunoglobulin sequence 
relationships within and between different mice in the same experiment, 
colour-coded according to mouse and treatment. Left pie charts indicate the 
percentage of edge connections that are based on Levenshtein distancelon 
the light chain compared to the heavy chain; right pie charts indicate the 
distribution of edge connections between individual B cells within or between 
exposure conditions. Corresponding networks for naive B cells are shownin 
Extended Data Fig. 6a, b fore, f, respectively. Tukey plotsinc,d are shown with 
arithmetic mean and whiskers of 1.5x interquartile range. Data are 
representative of two independent experiments. All data points are from 
individual mice. 


membrane and cytoplasmic antigens are processed and presented in 
the different anatomical compartments. 


Exposure context shapes IgA or IgG diversity 

IgA has previously been found to be oligoclonal®”*. We next followed 
how repertoire expansion or restriction developed according to isotype, 
exposure route and dose. Whereas IgG diversity generally increased con- 
siderably after systemic exposure, no increase in IgA diversity was found 
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Fig. 3 | Antimicrobial antibody responses after combined mucosal and 
systemic exposure or exposure to two different microbial taxa. 

a, b, Germ-free mice were intestinally (a) or systemically (b) exposed to £. coli 
HA107 onalternate days, or remained germ-free throughout. On day 21, all mice 
were intravenously (a) or intestinally (b) exposed to the indicated doses of 
HA107 (schema shown in Extended Data Fig. 7a, b). Flow cytometric analysis of 
specific bacterial binding of serum IgG2b (a) or intestinal IgA (b) on day 42. 

c, Germ-free mice were primed as ina, b. Immunoglobulin sequencing for 1gM 
of sorted splenic plasma cells. MDS plot of repertoire separation. d-g, 
Germ-free mice were mucosally primed with three doses of £. coli HA107 or 

S. typhimurium HA218, or remained germ free. At day 21, half of each group 
received asecond schedule of priming with the opposite taxon. n=3 for each 
condition (schema shown in Extended Data Fig. 8a). d, e, Bacterial flow 
cytometry of E£.-coli-HA107-specific (d) or S.-typhimurium-HA218-specific (e) 


in any tissue after maximal mucosal exposure (Fig. 2b)—and the diversity 
of both IgG and IgA was considerably less than that of IgM (Extended 
Data Fig. 5a). As the dose for mucosal exposure increased, the diversity 
of CDR3 clonotypes in the IgA heavy chain became more restricted, 
particularly in the mesenteric lymph nodes (Fig. 2c). By comparison, 
diversity of the IgG heavy chain in both the spleen and the mesenteric 
lymph nodes was maintained or further increased with higher levels of 
systemic exposure (Fig. 2c). We also confirmed directly in sorted cells 
that repertoire diversity is increased after systemic microbial exposure 
inlgG memory and plasma cells, but not after either route of exposure 
inIgA plasma cells (Fig. 2d). Transitory exposure by both the systemic 
and the mucosal routes generated only small numbers of somatic muta- 
tions, which were not substantially increased by increasing the transitory 
exposure doses (Extended Data Fig. 5b, c). The converse of the breadth 
of the isotype repertoires is the degree of clonal relatedness, which 
was higher for IgA than for IgG2b (Extended Data Fig. 5d, e). Mucosal 
and systemic exposures clearly have different physical constraints in 


intestinal IgA at day 42. f, Comparisons of mean repertoire diversity and 
median percentage of expanded clonotypes within the IgA repertoires inthe 
spleen (right) and mesenteric lymph nodes (left). g, Hierarchical clustering of 
full-length IgA heavy-chain sequences. h-k, Experimental design asind-g, 
except both reversible taxa given systemically. n =3 for each condition (schema 
shown in Extended Data Fig. 8b). h, i, Bacterial flowcytometry for serumIgG 
bacterial binding. j, Comparisons of mean repertoire diversity and median 
percentage of expanded clonotypes within the |gG2b repertoires inthe spleen 
(right) and mesenteric lymph nodes (left). k, Hierarchical clustering of 
full-length IgG2b heavy-chain sequences. f,j, Unpaired two-tailed t-test was 
performed onthe number of expanded clonotypes. Bars show ranges for each 
dimension. Data are representative of three (a) or two (b-j) independent 
experiments. All data points are from individual mice. 


terms of the epithelial barrier and scalability of local antigen process- 
ing and presentation to B cells. Nevertheless, our experiments show 
that increased doses of mucosal microorganisms only narrow the IgA 
repertoire, whereas the systemic IgG repertoire is progressively diversi- 
fied as the exposure dose is increased. 

To compare directly how immunoglobulin heavy- and light-chain 
sequences evolve in different mice during microbial exposure at the 
different sites, we analysed single-cell networks of paired heavy and 
light chains in naive and class-switched B cells. As shown in Figs. 1g, 2e, 
f and Extended Data Fig. 6a, b, light-chain diversity made only a minor 
contribution to the networks. IgA inthe mesenteric lymph nodes after 
intestinal exposure predominantly showed heavy-chain connectivity 
(including between germ-free and mucosally stimulated mice), whereas 
IgG networks from the spleen of systemically exposed mice had very few 
germ-free sequences (Fig. 2e, f). By contrast, germ-free immunoglobu- 
lin sequences were present in our analysis of naive B cells and interca- 
lated into the networks of both mucosally and systemically exposed 
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mice (Extended Data Fig. 6a, b). There was spontaneous activation 
and class-switch recombination to IgA (even in the mesenteric lymph 
nodes of germ-free mice), whereas splenic B cells became significantly 
activated after systemic microbial exposure (Extended Data Fig. 6c-f). 
We conclude that there are clear relationships of class-switched immu- 
noglobulin sequences between mice exposed to different microorgan- 
isms in both the mesenteric lymph nodes and the spleen. In diversely 
colonized mice, polyspecific antibodies have previously been shown 
to arise from specificities in the germ-free repertoire’®. We show here 
that the developing IgA repertoire that is characteristic of mucosal 
exposure is related to the class-switched immunoglobulin sequences 
present in germ-free mice. 


Oral sensitization of systemic responses 


Because mucosal or systemic microbial exposures generate different 
responses in the B cell repertoire at different thresholds, we asked 
whether exposure at one site could determine the characteristics of 
a secondary response at the other. The real-world situation is that 
there are variable degrees of sequential exposure in both mucosal and 
systemic compartments. 

Oral feeding of a soluble protein antigen tolerizes subsequent B and 
Tcell responses to systemic exposure with the same antigen”. By con- 
trast, asequence of microbial mucosal exposure followed by systemic 
exposure reduced the threshold for the systemic response by several 
orders of magnitude compared with intestinal exposure alone (Fig. 3a, 
Extended Data Fig. 7a). This effect is T-cell-dependent, as shown by 
the fact that the enhanced systemic response could be abrogated 
through CD4 T cell depletion at the time of the initial intestinal expo- 
sure (Extended Data Fig. 7c-f). Although systemic responses were 
sensitized by prior mucosal exposure to the same microorganism, this 
did not work the other way around; systemic exposure did not reduce 
the subsequent high threshold for mucosal IgA responses primed by 
the mucosal route (Fig. 3b, Extended Data Fig. 7b). 

Given that individual mucosal and systemic exposures lead to differ- 
ent B cell repertoires, we asked whether the order of sequential expo- 
sure (systemic to mucosal, or vice versa) makes a difference. Our results 
showed that, for splenic IgM plasma cells, the site of first exposure 
determined the resulting repertoire even after subsequent exposure 
at a different site (Fig. 3c), although the priming may be dominated by 
the site of first exposure. 


Functional effects of sequential exposures 


Our final question was how IgA and IgG repertoires build up with suc- 
cessive transitory exposures to different microorganisms. Mucosal 
exposure to the non-replicating Salmonella typhimurium strain HA218 
(engineered from a non-invasive, non-virulent parent strain on the 
same principles as those for £. coli HA107'%) attenuated an established 
mucosal IgA response to £. coli HA107, or vice versa with the opposite 
order (Fig. 3d, e, Extended Data Fig. 8a). By contrast, the broader sys- 
temic IgG response to either microorganism given systemically was 
unrestricted, irrespective of exposure order (Fig. 3h, i, Extended Data 
Fig. 8b). The repertoires of CDR3 clonotypes were also biased by the 
most recent mucosal exposure, whereas after sequential systemic expo- 
sures the clustering was distinct regardless of exposure order (Fig. 3g, k). 
The clonal relatedness and diversity of IgA showed no significant 
increase after successive treatments, by contrast with the increased 
IgG clonal diversity and relatedness in the spleen as successive systemic 
treatments were administered (Fig. 3f, j). These results are consistent 
witha greater flexibility of new systemic IgG specificities to accommo- 
date aresponse to another microorganism (in order to avoid systemic 
sepsis), whereas IgA mucosal protection requires more generic—and 
probably lower—affinity responses*”° that adapt sequentially to many 
different varieties of overlapping antigen exposure?) ”), 
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Here we have shown that the route and order of exposures to micro- 
bial antigens determine the repertoire and functional outcome for B cell 
immune responses. In general, the systemic exposure thresholds are 
lower, but trigger responses that expand and diversify the B cell reper- 
toire (as compared with mucosal exposure). This is consistent with the 
IgA repertoire building on an evolutionarily determined baseline, as 
some of the clonotypes overlap with immunoglobulin sequences from 
defined natural antimicrobial specificities (Extended Data Table 1). 
These different functional demands from B cell immunity offer an 
explanation for the different characteristics of responses of the B cell 
system against challenges from the microbiota in different host com- 
partments, the expansion of the same B cell clone in multiple Peyer’s 
patches” and poor responses to the mucosal route for vaccination, 
where limited hygiene results in an increased burden of environmental 
microorganisms and recurrent mucosal infections”*”*. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Mice 

Germ-free C57BL/6 mice (males or females, 8-16 weeks of age) were 
bred and maintainedin flexible-film isolators at the Clean Mouse Facility 
(University of Bern). Sex balance was ensured in all experimental groups 
and germ-free status was routinely monitored by culture-dependent 
and -independent methods. All mouse experiments were performed 
in accordance with Swiss Federal regulations approved by the Com- 
mission for Animal Experimentation of Kanton Bern. Group sample 
sizes of >3 mice were determined from preliminary experiments that 
indicated sufficient power to discriminate repertoire effects. Compu- 
tational analysis, but not experimental processing, was blinded to the 
treatment of each animal. 


Bacterial culture 

Escherichia coli HA107 and non-invasive non-virulent S. typhimu- 
rium HA218 (UK-1 Aasd, Aalr, AdadX, AmetC::tetRA, AssaV::camR, 
AinuC.:aphT) were cultured overnight in LB medium containing 100 
pg/ml meso-DAP and 400 pg/ml D-alanine at 37 °C, shaking at 200 
rpm’. A full description of in vivo characteristics of the reversible Sal- 
monella strain has been published separately’. Enterobacter cloacae 
was cultured in LB medium at 37 °C, 200 rpm. Clostridium orbiscindens 
(DSMZ 8061) was cultured in BHI medium with additional haemin and 
vitamin K at 37 °C ina Whitley MG500 anaerobic incubator gassed with 
10% (v/v) H,, 10% CO, and 80% N,. Reversible colonisations with C. orbis- 
cindens (DSMZ 8061) were confirmed through culture-dependent and 
-independent methods to return to germ-free status after live exposure. 
To prepare gavage solutions or intravenous injections, bacteria were 
centrifuged for 10 min at 4,000g and washed twice with sterile PBS. 
The required dose was resuspended in 500 pl of sterile PBS and admin- 
istered to germ-free mice by oral gavage or injection into the tail vein. 


Preparation of £. coliHA107 lysates and subfractions 

Overnight culture of FE. coli HA107 was collected, washed with PBS and 
resuspended at 10° CFU/ml in PBS. Bacterial cells were disrupted by 
ultrasound using the Bioruptor system for 15 min at 4 °C (Diagenode, 
30-s on/30-s off mode). The lysate was centrifuged at 10,000g for 10 min, 
diluted and sterilized by 0.2-um filtration before in vivo treatment or 
further fractionation. The lysate was fractionated into membrane and 
cytoplasmic fractions by ultracentrifugation (Optima MAX-TL bench- 
top ultracentrifuge, Beckman) for 10 min, 100,000g. The cytoplasmic 
supernatant was collected and the pellet containing membranes was 
resuspended in PBS. An18-h ultracentrifugation at 100,000g was per- 
formed onthe cytoplasmic fraction to separate the ribosomal fraction 
fromthe non-ribosomal fraction”. Protein concentrations were quanti- 
fied using the Pierce BCA assay kit (Thermo Fisher Scientific). 


Systemic and intestinal exposure to live reversible E. coliHA107 
For systemic exposure, germ-free mice were aseptically transferred toa 
laminar flow hood. Doses (107-108 CFU, as shown) of E. coliHA107 were 
administered into the tail vein. For intestinal exposure, mice received 
10?-10"° CFU of E. coli HA107 by oral gavage within germ-free isolators’. 
When multiple doses were administered, mice received three doses on 
alternate days. Mice were analysed two weeks after administration of 
the last exposure dose. 


Systemic and intestinal priming of immune responses with 

E. coliHA107 lysate 

The preparation of bacterial lysates was performed as described in 
‘Preparation of £. coli HA107 lysates and subfractions’. Germ-free 


mice were treated with bacterial lysates (dose equivalent to 10° 
intact bacteria) either systemically or orally on alternate days. Mice 
were analysed two weeks after administration of the last priming 
dose. 


Combined intestinal and systemic priming with E. coli HA107 
Germ-free C57BL/6 mice were divided into two groups. In the first sce- 
nario, agroup of mice was treated with three doses of 10°° CFU of HA107 
orally on days 0,2 and 4, and the other group were unprimed controls. 
Both groups of mice (intestinally primed with 3 doses of 10'° CFU of 
E. coliHA107 and control mice) received an intravenous injection of 
10°-10’ CFU (as shown) of E. coli HA107 on day 21. 

Inthe second, converse scenario, a group of mice was treated 3 times 
with 108 CFU of HA107 systemically on days O, 2 and 4 and the other 
group were unprimed controls. Both groups of mice subsequently 
received 3 intragastric doses of 10’-10° CFU (as shown) of E. coli HA107 
on days 21, 23 and 25. All mice from both scenarios were analysed on 
day 42. 


Combined priming with £. coliHA107 and S. typhimurium HA218 

For mucosal priming, germ-free mice received 10” CFU of £. coli HA107 
by gavage on days 0, 2 and 4, or were unprimed controls. Subsequently, 
half the mice in both groups received 3 doses of 10° CFUS. typhimurium 
HA218 on days 21, 23 and 25 by gavage, and the other half received no 
further priming. Within the same experiment further groups received 
the opposite treatment sequence of S. typhimurium HA218 to E. coli 
HA107. Systemic priming was carried out using groups of germ-free 
mice with the above schedule in both treatment orders, except that 
3 doses of 10° CFU of £. coli HA107 or S. typhimurium HA218 by intra- 
venous injection were used instead of the intestinal treatments for 
mucosal priming. Recovery of germ-free status was verified at each 
stage. All mice were analysed on day 42. 


Priming with cholera toxin subunit B 

Germ-free mice each received 3 doses of intragastric (15 pg) or intrave- 
nous (15 pg) cholera toxin B (CTB) onalternate days and were analysed 
3 weeks after the last priming dose. For intragastric delivery, CTB was 
dissolved in 0.1M sodium bicarbonate (pH 9); systemic delivery was 
in PBS. 


Bacterial fluorescence-activated cell sorting 

Bacterial FACS was performed as described previously’, with the 
following refinements. F. coli HA107 was incubated with serum or intes- 
tinal washes with starting dilutions at 1/10 for serum, or undiluted 
for intestinal washes: in each sample the concentrations of different 
immunoglobulin isotypes had been measured separately by ELISA, 
as detailed below. After incubation and washing to remove unbound 
antibodies, the samples were incubated further at 4 °C for 2 heither with 
secondary anti-mouse IgA-FITC (BD Biosciences C10-3) or anti-mouse 
IgG2b-FITC (BD Biosciences R12-3). Bacteria were acquired ona BD 
FACSArray using FSC and SSC parameters in logarithmic mode. The 
FACS data were analysed using FlowJo software (TreeStar), and the levels 
of bacteria-specific antibodies present in the samples were expressed 
as the geometric mean against the absolute immunoglobulin concen- 
tration in the sample. 


ELISA 

Immunoglobulin-specific ELISAs were performed as previously 
described” using goat anti-mouse IgA, IgM, IgG1, lgG2b, IgG2c, 1gG3 
coating antibodies from SouthernBiotech. Antibody standards were 
purified mouse myelomaimmunoglobulin of the required isotype with 
peroxidase-coupled secondary antibodies as previously detailed. The 
same protocol was used for antigen-specific ELISA, except that ELISA 
plates were first coated overnight with 10 pg/ml of HA107 membrane, 
cytoplasmic, ribosomal or non-ribosomal fractions. 
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Immunoblotting 

Bacterial (£. cloacae) cytoplasmic proteins (10 pg/ml) were loaded to 
12% SDS-PAGE and electrophoresed for 45 min at 90V; subsequently 
they were either stained directly for proteins or transferred to a nitro- 
cellulose membrane. After blocking with 3% BSA, the membrane was 
incubated with intravenous primed mouse serum diluted at 1:30 in PBS 
for 1h at room temperature. Peroxidase-conjugated goat anti-mouse 
IgG catalysis of 3, 3’-diaminobenzidine was used to detect bound IgG””. 


LPS versus 16S rRNA measurements 

Germ-free mice were treated either systemically with 10° CFU of £. coli 
HA107 or mucosally with 10"° CFU of E. coli HA107. Mice were dissected 
18 hlater. Spleen and MLN were collected and disaggregated on baked 
40-um metal cell strainers using PBS. Samples were then split into two 
portions for either LPS or 16S rRNA estimation. 


LPS measurements. Samples were centrifuged (350g, 5 min) to remove 
intact cells. Supernatants were heated at 70 °C for 30 min and cleared 
by centrifugation at 10,000g. LPS was quantified using the EndoZyme 
Il Recombinant Factor C Endotoxin Detection Assay (Hyglos, BioMé- 
rieux no. 890030) according to the manufacturer’s instructions, using 
tenfold dilutions to circumvent interference. After incubation at 37 °C 
for 90 min fluorescence was measured at an excitation of 380 nm and 
an emission of 445 nm (Infinite Pro 200, Tecan). End-point fluorescent 
measurements with blank corrections and standards from 0.005-5 
EU/ml were used. Standard curve generation and regression analysis 
performed in GraphPad Prism were used to estimate LPS concentra- 
tions in samples, with extrapolation of the standard curve based on 
the regression parameters. Samples and standards were measured 
in duplicate. 


16S measurements. Total RNA was extracted from samples using 
the DNA-RNA all prep kit (Qiagen), and real-time PCR was performed 
on synthesized cDNA using forward (5’-ATG CGT AGA GAT CTG GAG 
G-3’) and reverse primers (5’-CAA CCT CCA AGT CGA CAT C-3’) for 16S 
ribosomal gene amplification. 


Lymphocyte isolation 

Toisolate lymphocytes from spleens, lymph nodes and Peyer’s patches, 
tissues were cut into small pieces and digested in IMDM (2% FCS) con- 
taining collagenase type IA (1 mg/ml, Sigma) and DNase | (10 U/ml, 
Roche) at 37 °C for 30 min. Cellular suspensions were passed through 
a cell strainer (40 pm) and washed with IMDM (2% FCS, 2mM EDTA). 
Erythrocytes in splenic suspensions were lysed using 0.84% NH,Cl 
for 2 min at room temperature. Cells were finally resuspended in FACS 
buffer and counted in a Neubauer chamber. 


Flowcytometry 

After isolation, cells were washed once with PBS before staining with 
fixable viability dye diluted in PBS (eBisocience) for 30 min on ice. 
Single-cell suspensions were sequentially incubated with primary 
biotin- or fluorescence-coupled antibodies diluted in FACS buffer. The 
following mouse-specific conjugated antibodies were used: B220-FITC 
(RA3-6B2, Biolegend), BCL6-AlexaFluor647 (K112-91, BD Biosciences), 
CD3-BV7835 (17A2, Biolegend), CD4-PE (RM4-5, Biolegend), CD11b- 
PEcy7 (M1/70, Biolegend), CD19-AF700 (6D5, Biolegend), CD21-FITC 
(7E9, Biolegend), CD22-FITC (OX-97, Biolegend), CD23-PerCPcy5.5 
(B3B4, Biolegend), CD25-PerCP-cy5.5 (PC61, Biolegend), CD43-FITC 
(1B11, Biolegend), CD80-PerCPcy5S.5 (16-10A1, Biolegend), CD86-BV421 
(GL-1, Biolegend), CD93-PEcy7 (AA4.1, Biolegend), CD138-PE (281-2, 
Biolegend), CD273-APC (TY25, Biolegend), CXCR5-biotin (2G8, BD 
Biosciences), IgA—-FITC (C10-3, BD Biosciences), IgG2b-FITC (R12-3, 
BD Biosciences), IgM-PEcy7 (RMM-1, Biolegend), Ly77-BV421 (GL-7, 
BD Biosciences) and PD-1-PE (RMP1-30, Biolegend). Streptavidin-APC 


(BD Biosciences) was used together with biotin-coupled antibodies. 
Data were acquired on an LSRFortessa (BD Biosciences) and analysed 
using FlowJo software (Tree Star). In all experiments, FSC-H versus 
FSC-A was used to gate singlets, and to exclude dead cells using the 
fluorescence-coupled fixable viability dye (eBisocience). B lineage 
subsets were stained for flow cytometry cell sorting as: pre B cells from 
bone marrow (live/dead CD3 CD19*CD138 IgM CD25’), plasma cells 
from bone marrow (live/dead°CD3 CD138*CD19- CD22 ), transitional 
Bcells from spleen (live/dead CD3°CD19*CD138° GL7 CD93’), splenic 
IgM* plasma cells (live/dead CD3 CD19 CD138*IgM’), splenic IgG2b* 
plasma cells (live/dead CD3 CD19 GL7 CD138*IgG2b’), follicular 
splenic B cells (live/dead CD3 CD19*CD93 CD138 CD23"2*CD21'™), 
splenic marginal zone B cells (live/dead CD3°CD19*CD93 CD138 °C 
D23'’CD21"*"), splenic IgM* memory cells (live/dead CD3°CD19*CD 
138 CD80°CD273'IgM")*s, splenic IgG2b* memory cells (live/dead-C 
D3 CD19*GL7 CD138 1gG2b’), peritoneal B2 cells (live/dead CD3°C 
D19*CD23*CD43 ), peritoneal Bla cells (live/dead-CD3°CD19*CD23 
~CD43'B220™IgM"2"), peritoneal B1b cells (live/dead CD3 CD19*C 
D23°CD11b*B220"2"lgM'™) and splenic/MLN IgA‘ plasma cells (live/ 
dead CD3 CD19 GL7 CD138'1gA’‘). 


Invitro primary cell culture 

Primary cells were isolated from spleen, Peyer’s patches and MLN 
from germ-free or £.-coli-HA107-conditioned mice. Tissues were 
disaggregated through 40-um cell strainers and flushed with 10 ml 
IMDM + GlutaMax (GIBCO) supplemented with 10% FCS (Invitrogen). 
Cell suspensions were centrifuged (350g, 5 min, 4 °C) and erythrocyte 
lysis from splenic samples was performed as described in ‘Lymphocyte 
isolation’. 

For the estimation of IgA and IgG2b production by B cells, cells 
were centrifuged again and resuspended in complete IMDM medium 
IMDM + GlutaMAX (GIBCO), supplemented with 10% FCS (Invitrogen), 
10,000 pg/ml streptomycin (GIBCO), 10,000 U/ml penicillin (GIBCO), 
50 uM 2-mercaptoethanol (Merck). The cell suspensions were seeded 
at a density of 10° cells/ml in 200 pl of complete IMDM medium per 
well, in 96-well round-bottom tissue culture plates. The cell cultures 
were incubated at 37 °C, 5% CO,. Viability of the cultured cells was con- 
trolled daily microscopically. Supernatants were collected after five 
days and antibody concentrations were determined using ELISA as 
described in ‘ELISA’. 

For the assessment of repertoire reprogramming to antigen exposure, 
an aliquot of cells was collected and CD86*CD138* antigen-experienced 
plasma cells were sorted and used as pre-treatment controls. Remaining 
cells were seeded at a density of 10° cells/ml in RPMI1640 medium sup- 
plemented with FCS, streptomycin, penicillin and B-mercaptoethanol 
as above, with the addition of 10 ng/ml recombinant IL-4 (Peprotech) in 
a 48-well flat-bottomed plate. Escherichia coli cytoplasmic and mem- 
brane fractions were added at 5 pg/ml (protein equivalent) to culture 
wells. The culture conditions were as detailed above. On day 6, cells 
were collected, and CD86*CD138" plasma cells were FACS-sorted for 
repertoire analysis. 


CD4 T cell depletion 

Germ-free mice were injected intraperitoneally either with 250 pg of 
anti-CD4 antibody (clone GKS.1, Bioexcel) or anisotype control (clone 
LTF-2, Bioexcel) diluted in 250 ll of PBS on day —3 before intestinal expo- 
sure with 3 x 10° CFU E. coli HA107 on days O, 2 and 4. T cell depletion 
and recovery were confirmed by flowcytometry on blood samples taken 
on days 0 and 21, respectively. All mice received a single intravenous 
dose of 10’ CFU £. coli HA107 on day 21. Mice were analysed on day 42. 


Immunoglobulin repertoire bulk sequencing 

Tissues were removed and snap-frozen in Trizol reagent using liquid 
nitrogen. Thawed tissues were immediately homogenized (Retsch 
bead-beater) in1 ml of Trizol reagent (Life Technologies). Chloroform 


(200 pl) was added, samples were mixed, and centrifuged (12,000g, 
15 min, 4 °C). The upper phase was collected, and RNA was precipitated 
with ice-cold isopropanol by centrifugation (12,000g, 10 min, 4 °C). The 
RNA pellet was washed with 75% (v/v) ethanol, dried and resuspended in 
RNase-free water. RNA concentrations and purity were analysed using 
a Nanodrop2000 (Thermo Scientific). 

For IgA, cDNA synthesis was performed by mixing 700 ng of RNA, 
1 pl of 2 uM gene-specific primer mix’, 1 pl of 10 mM of dATP, dCTP, 
dGTP and dTTP (Invitrogen), with dH,O to a total volume of 13 pl. For 
IgM and IgG2b heavy chains and light chains, oligo(dT) primers were 
used for cDNA synthesis. Samples were then heated to 65 °C for 5 min, 
after which they were put on ice for a minimum of 1 min. Tubes were 
shortly centrifuged to collect contents, and ina second reaction step, 
4 ul 5x first strand buffer (Invitrogen), 1 p10.1M DTT (Invitrogen), 1 pl 
RNaseOUT (Invitrogen) and 1p Superscript III RT enzyme (Invitrogen) 
were added to the reaction, mixed and incubated at 55 °C for 50 min. 
The reaction was terminated by heat inactivation at 70 °C for 15 min 
followed by cooling at 4 °C. Samples were stored at —20 °C until further 
processing. 

Amplicon PCR for IgA, IgM or IgG2b heavy chains and k or A light 
chains was performed by mixing 5 pl of 10x PlatinumTaq PCR buffer 
(Qiagen), 1 pl of 10 mM mixed dNTPs, 0.2 pl of PlatinumTaq DNA Poly- 
merase (Qiagen), 1.5 pl of 50 mM MgCl,, 1 pl of each 100 pM forward 
primer mix and reverse primers (see sequences in Supplementary 
Table E, according to chain and isotype), 5 pl template cDNA, ina total 
volume of 50 ul. The forward primers consisted of a degenerative mix 
of 19 different primers binding into FR1 as previously described?*”°. 
Reverse primers were complementary to the constant region of each 
isotype (see sequences in Supplementary Table E). All primers used in 
the study are detailed in the Supplementary Information. 

PCR products were loaded onto a1.5% (w/v) agarose gel and purified 
with the QIAquick Gel Extraction kit (Qiagen) according to the manu- 
facturer’s instructions. The purified DNA was tested for quality and 
quantity using a 2100 Bioanalyzer (Agilent) and Qbit (Thermofisher). 
Asecond PCR was performed to attach sample indices and sequencing 
adapters using the Nextera XT Index Kit (Illumina). Subsequent sample 
sizes and quality were tested with a Fragment Analyzer (Advanced 
Analytical), normalized and libraries were pooled for sequencing on 
the MiSeq Illumina sequencer in the paired 250-bp mode, or the 300-bp 
mode for experiments containing UMIs. 


Repertoire sequencing and pre-processing 

Antibody variable heavy-chain (V,,) libraries were prepared as pre- 
viously described, and sequenced on the Illumina MiSeq platform 
(2x 250 cycles, paired-end) with 10% PhiX control library. Mean base-call 
quality of all samples was in the range of Phred score 30. The resulting 
FASTQ files were pre-processed (VDJ alignment, clonotyping) using the 
MiXCR software package v.2.1.12 (clonotype formation by CDR3 amino 
acid region annotated using C57BL/6J germline gene data” (available 
at http://mixcr.milaboratory.com/ and https://github.com/milabora- 
tory/mixcr/)). 

MIXCR considers sequence quality and corrects PCR and sequencing 
errors in the alignment step to the reference V, D andJ germline seg- 
ments. The error correction is done by assembling the clonotypes with 
aheuristic multilayer clustering, which can be summarized as follows. 
The algorithm consists of two steps: (a) a k-mer chaining algorithm is 
used, then—to find the optimal reference sequence—a heuristic search 
is performed to maximize a scoring function that takes into account 
variations (deletions, insertions and mutations) inthe sequence reads. 
(b) Subsequently, multiple candidate alignments are built and a classi- 
cal alignment score (scoring matrix and penalties for indels) is calcu- 
lated. Reads are filtered onthe basis of these values. Then, clonotypes 
are assembled and those presenting errors are further removed by clus- 
tering similar sequences on the basis of a defined similarity threshold 
(clonotyping). Finally, MIXCR rescues low-quality reads ina later step 


by mapping them to previously assembled high-quality clonotypesto 
preserve maximal quantitative information. Singletons (supported 
by only a single read) were excluded from clonotype output”’, as 
previously described”. 

For all analyses, except where nucleotide analysis combined with V 
and J segment information is specified, clones were defined by 100% 
amino acid sequence identity of CDR3 regions. CDR3 regions were 
defined by MiXCR according to the nomenclature of the Immunoge- 
netics database (IMGT)**. 


Clonal analysis and quantification of repertoire convergence 
MIXCR output files were further processed in VDJtools post-analysis 
framework* (available at https://github.com/mikessh/vdjtools) to 
further analyse clonotype diversity, V gene use and sample clustering 
onthe basis of CDR3 abundance. Further filtering was applied to keep 
only in-frame, productive sequences. Basic statistic segment use was 
calculated, and weighted variable use profiles were hierarchically clus- 
tered and visualized as heat maps. Repertoire overlap was measured 
by calculating the geometric mean of relative overlap frequencies 
between variable segment use profiles or CDR3 amino acid sequence 
use. The relative overlap similarity was represented as hierarchical 
clustering or MDS plots. Heat maps following the top100 clonotypes 
(ranked by abundance) present in at least 2 samples were generated 
on the basis of an all-versus-all intersection between chosen samples 
with settings = ‘strict’, meaning intersections had to contain the iden- 
tical nucleotide sequence (CD3nt sequence + V gene + J gene). The 
corresponding CDR3 amino acid sequences are provided in the Sup- 
plementary Tables, numbered respective to the figures in which each 
heat map is found. 


Technical replicate sequencing and UMI pre-processing 
Technical reproducibility as previously described?*”! was tested on 
cDNA derived from 9 C57BL/6 mice. cDNA was split into three por- 
tions and library preparation was performed in parallel with different 
Illumina indices according to the manufacturer’s instructions and 
protocol. Replicate samples were sequenced with the same sequenc- 
ing depth. cDNA was amplified using primers with the alternative 
primer design approaches of 15 nucleotide unique molecular identi- 
fiers (UMI_1)”° or with UMI with fixed positions preventing secondary 
structures (UMI_2)” by PCR ina multiplex reaction using primer sets 
as described for the Illumina protocol above. Raw reads were demulti- 
plexed and exported without the sample barcodes or Illumina cluster- 
ing adapters. Pre-processing of the raw demultiplexed reads was carried 
out using pRESTO**. Reads with a mean Phred quality score below 20 
were removed, then those without valid forward (V-region) and reverse 
(isotype specific constant region) primer sequence matches were 
removed (match error rate of 0.2). The forward primer regions were 
masked (with Ns) and the reverse primer regions were deleted from 
the sequence. Reads with identical UMIs were collapsed into a single 
consensus sequence for each UMI. UMI read groups witha nucleotide 
diversity score exceeding 0.1 were discarded. Finally, corrected con- 
sensus sequence read pairs were assembled into full length sequences 
witha maximum allowed error rate of 0.2 and Pvalue threshold of 0.1. 
All unique sequences that were not represented by at least two raw 
reads were removed from further analysis and duplicate full-length 
sequences were discarded. Following pre-processing, sequences were 
further processed using MIXCRas described in ‘Repertoire sequencing 
and pre-processing’ with the computational correction of PCR errors 
option turned off (-OcloneClusteringParameters = null). 


Mutation and N-joints 

To resolve junctional and N-joint regions, the V, N1, D, N2 andJ sub- 
regions were defined on the sequences exported using MIXCR. 
Normalized CDR3 subregion (V, N1, D, N2 and J) lengths (median) of 
clonotypes by B cell subpopulation were calculated. Mutation analysis 


Article 


was performed on the annotated sequences exported using MIXCR 
excluding N-joint regions. Each mutation was reported at the amino 
acid level for Vand J segments counted per sequence. The median 
number of mutations per sequence over a given repertoire is reported. 


Rarefaction plots 

Toexplore the relationship between sample size and diversity coverage, 
rarefaction and extrapolation sampling curves were generated using 
the iNEXT R package”. 


Determination of sequence similarity among clones within a 
repertoire 

The Levenshtein distance between all pairwise CDR3 amino acid 
sequence combinations of identical CDR3 amino acids was calculated, 
and each Levenshtein distance was subsequently normalized by the 
sequence length of the respective sequence combination. Levenshtein 
distances were computed using the stringdist package in R*°. 


Network construction and analysis 

The networks were constructed by representing each unique amino 
acid CDR3 sequence (clone) as nodes. Clones that were exactly one 
amino acid substitution, insertion or deletion (Levenshtein distance =1) 
apart were connected by edges as previously described“. An antibody 
repertoire network is an undirected graph G=(V, £) described asa set of 
nodes (CDR3 vertices, V) together witha set of connections (similarity 
edges, £), representing the adjacency matrix A of pairwise Levenshtein 
distances (LD) between CDR3 amino acids. 

Degrees (number of similar CDR3 sequences to a specific CDR3 
sequence) were calculated for each similarity layer LD1 for each CDR3 
sequence in each sample. CDR3 with zero degrees that were not simi- 
lar to any other CDR3 in the network were excluded to fit degree dis- 
tributions. The contribution of expanded clonotypes within a given 
repertoire consisted of the sum of clonotypes with a degree >1ona 
Levenshtein distance 1 network over the size of the repertoire. The 
networks were visualized employing Cytoscape or the Fruchterman- 
Reingold layout and Kamada-Kawai layout algorithms using the igraph 
R package*. For networks based on single-cell data, concatenated 
heavy- and light-chain amino acid CDR3 sequences from the same 
cell were analysed. 


Single-cell transcriptomics and repertoire sequencing 
Germ-free mice were mucosally exposed to 3 x 10"? CFU F. coli HA107, 
systemically exposed to 3 x 10° CFU £. coli HA107 or maintained as 
germ-free controls. Spleens and mesenteric lymph nodes were removed 
on day 21 and disaggregated to single-cell suspensions. Cells were 
enriched for CD138* plasma cells by magnetic cell sorting. Cells were 
washed once and resuspended in PBS/0.05% (w/v) BSA. The 10x Genom- 
ics platform was used to perform concurrent B cell V(D)J repertoire 
sequencing and single cell 5’ gene expression analysis. Prepared 
libraries were sequenced on an Illumina NovaSeq 6000 (2 x 150 cycles, 
paired-end mode). 


V(D)J immunoglobulin single-cell sequencing analysis 
Processing raw sequencing data. The CellRanger software version 
3.0.1 was used to process the raw sequencing data produced by the 
10x Chromium direct immunoglobulin enrichment. The 10x Genom- 
ics mouse genome GRCm38.93 3.0.0 release was used as reference for 
the alignments. The ‘filtered contigs’ files were subsequently used for 
downstream analyses. 


V(D)J annotation of BCR contigs. Following pre-processing, sequences 
were further analysed using MIXCR v.2.1.12 as described in ‘Repertoire 
sequencing and pre-processing’ with the computational correction of 
PCR errors option turned off (-OcloneClusteringParameters = null). 
V(D)J alignment and annotation was performed against C57BL/6J 


germline gene data. CDR3 regions were defined by MIXCR according to 
the nomenclature of the Immunogenetics database (IMGT). Clonotypes 
were defined by 100% amino acid sequence identity of CDR3 regions. 


Filtering and display of relationships between exposure conditions 
in B cell receptor repertoire contigs and cells. As with bulk Repseq 
analyses, only productively rearranged clonotypes were used. Cells 
with one heavy chain contig paired with one light chain contig were 
kept for downstream analysis. Cells remaining were checked to havea 
transcriptomic profile consistent with B-cells (cells with any one of the 
following markers: CD19, CD138 or the isotype-encoding genes witha 
positive log-normalized expression level). Given the limited output 
at single-cell level, repertoire overlap was relaxed from 100% to 90% 
amino acid identity incorporating a length correction’. The relative 
overlap was normalized for the resulting repertoire size and similarity 
was represented as hierarchical clustering or MDS plots. 


Statistical significance 

Statistical significance was tested either using Student t-test or the 
Wilcoxon rank-sum test followed by a Benjamini-Hochberg correc- 
tion for false discovery rate. Results were considered significant for 
adjusted P< 0.05. 


Availability of biological materials 

The bacterial strains for reversible germ-free colonisation can be 
obtained without restriction from A.J.M. and S.H. following a mate- 
rial transfer agreement with the University of Bern. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The raw files for the datasets generated during this study are available 
onthe Sequence Read Archive, Bioproject accession PRJINA625440. The 
pre-processed files (clonotype list and properties) are available onthe 
GitHub repository https://github.com/Mucosal-Immunology-Bern/ 
Manuscript-microbiota-and-B-cell-repertoire. 


Code availability 


The associated codes for the analysis using R packages can 
be found at https://github.com/Mucosal-Immunology-Bern/ 
Manuscript-microbiota-and-B-cell-repertoire. 
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Extended Data Fig. 1| Mucosal and systemic exposure differentially shape 
the repertoires of the various immunoglobulinisotypes. a—h, Germ-free 
mice were either orally or systemically primed three times every other day by 
intragastric (10"° CFU) or intravenous (10° CFU) delivery of E. coliHA107 (a-g) 
or Clostridium orbicindens (h) and compared with germ-free control mice. 
Immunoglobulin heavy-chain-repertoire clonotypes at day 21 are defined by V 
andJ segment usein combination with CDR3 nucleotide sequences, except for 
a, inwhich clonotypes are based on CDR3 amino acid sequences as in Fig. 1. 

a, MDS plot of IgA, IgG2b and IgM showing distinct isotype repertoires (ineach 
case allisotypes were sequenced from each individual sample from at least 3 
mice). b-d, MDS plot from ileum, MLN, bone marrow (BM) or spleen (SPL) of 
immunoglobulin repertoire sequencing for IgA (b), 1gG2b (c) and IgM (d). 


e-h, Heat maps showing the 100 most abundant non-unique clonotypes for IgA 
(e, h), IgG2b (f) and IgM (g). Clonotype specifics for each panel are shownin 

the Supplementary Information. Samples of ileum, BM, MLN and SPL fromeach 
mouse are included. Individual mice are colour-coded on the x-axis of heat 
maps. i,j, Immunoglobulins from intestinal wash of intestinally exposed mice 
(i) or from serum of systemically exposed mice (j) on day 21 were assessed in 
bacterial flow cytometry for specific IgM, IgG1, IgG2b, IgG2c, IgG3 orIgA 
binding to £. coli HA107.k, I, EF. coliHA107 was incubated with day 21 serum or 
intestinal wash immunoglobulin of the three groups of mice followed by 
detection of bound mouse 1gG2b (k) or IgA (1) by flowcytometry. All data 
points are from organs of individual mice. 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 |Comparison of computational correction method 
with UMIs for PCR artefacts and sequencing reproducibility on different 
occasions. Germ-free mice were either orally or systemically primed three 
times every other day by intragastric (10'° CFU) or intravenous (10° CFU) 
exposure to £. coliHA107 and compared with germ-free control mice. 
Immunoglobulin repertoire sequencing at 21 days for IgAin MLN and IgG2bin 
spleen was performed in parallel from the same RNA samples with two different 
primers containing different UMIs or primers without UMIs. a, b, MDS plot of 
MLN IgA (a) or spleenIgG2b (b) repertoires showing Euclidean distance 


between points representing the similarity between results whether 
computational correction (filled symbols) or UMI correction (openand cross 
symbols for the different UMIs) was used. c, Heat map based on CDR3 sequence 
identity reflecting similarity between UMI-corrected and computational- 
corrected IgG2b repertoires. d, Identity matrix based on CDR3 sequences of 
technical replicates of the same biological sample that had been used ontwo 
separate occasions for the entire pipeline of cDNA preparation, IgA amplicon 
PCR, library preparation and MiSeq sequencing. The distinctions between 
replicate mice and technical repeats are shown inthe diagram. 
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Extended Data Fig. 3|See next page for caption. 
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Extended Data Fig. 3 | Threshold differences for shaping systemic or 
mucosal B cell repertoires and induction of antibody responses. Reversible 
E.coliHA107 was given to germ-free mice at the indicated doses orally (a, b,e, f) 
or systemically (c,d, g,h), asin the legend to Fig. 1.a,c, Immunoglobulin 
repertoire analysis at 21 days showing heat maps of the top 100 non-unique 
clonotypes for IgA and IgG2b in MLN and spleen based on V,J segment use 
combined with CDR3 nucleotide sequences. b, d, Flowcytometric analysis of 
E.-coli-specific immunoglobulin binding from intestinal or serum samples 
from the corresponding mice inaandc, respectively. e-h, Hierarchical 
clustering of the indicated immunoglobulin repertoires. Dendrogram of 


samples, branch length shows the distance between repertoires based on 
CDR3 amino acid sequence similarity of entire B cell repertoires. The dotted 
lines indicate the principal separations in unsupervised analyses as an 
independent assessment of thresholds of exposure required for repertoire 
shaping. Each column within the heat maps or each dilution series from 
antibody-binding bacterial flow cytometry are from individual mice; in every 
case, three mice were used for every experimental condition studied. 

The Supplementary Information contains a table specifying the top 100 
clonotypesin each case for a,c. Data are representative of two independent 
experiments. 
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Extended Data Fig. 4| Differences in processing of microbial antigens, and 
presentation in the mucosal and systemic compartments. a, Reversible F. 
coliHA107 were given to germ-free mice at the indicated doses orally or 
systemically, asin the legend to Fig. 1. Binding of systemic or intestinal 
antibodies to £. colinon-ribosomal versus ribosomal proteins of the 
cytoplasmic fraction evaluated using ELISA (X +s.d., n= 6, two-sided unpaired 
t-test). b, c, Enterobacter cloacae proteins were either separated by FPLC or 
subject to ribosomal protein purification, before 12% SDS-polyacrylamide gel 
electrophoresis. Silver stain for total protein (b). Western blot for 
immunoreactivity against serum IgG raised from specific pathogen-free 
C57BL/6 mice injected with 10° CFU E. cloacae 20 days previously (c). Control 
experiments verified previously published data” that un-manipulated control 
mice showed noimmunoreactivity against this dominant intestinal aerobe. 
Proteomic analysis of extracted bacterial ribosomal proteins confirmed 
identities as follows: 50SL24, 50SL11, SOSL4, 50SL3, SOSL1, 30SS4 and 30SS1. 
d, Germ-free mice were orally (n= 6) or systemically (n=5) exposed once by 


@ Germ-free 


intragastric (10° CFU) or intravenous (10° CFU) delivery of £. coli HA107. 
Mesenteric lymph nodes and spleens were analysed 18 h later for LPS levels and 
16SrRNA (x +s.d., two-sided unpaired f-test). e, f, Indicated doses of intact or 
ultrasound-lysed E. coliHA107 were administered orally (e) or systemically (f) 
three times to germ-free mice every other day and compared to germ-free 
control mice. Immunoglobulin heavy-chain repertoire analysis at 21 days for 
IgA in mesenteric lymph nodes (e) or IgG2b in the spleen (f). g, In vitro culture 
of leukocytes from the mesenteric lymph nodes or spleen of germ-free mice 
stimulated with either cytoplasmic or membrane fractions of E. coli HA107. 
Activated B cells were sorted on day S after stimulation and IgM heavy chain 
sequencing was carried out. eg, Inall cases, Euclidean distance in MDS plots 
reflects the distance between indicated repertoires based on CDR3 amino acid 
sequences of the entire B cell repertoire. a-d, Data are representative of two 
independent experiments. e-g, Data are from two single experiments. Each 
data point is from the organ of an individual mouse. 
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Extended Data Fig. 5| See next page for caption. 
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Extended Data Fig. 5 | Network formation of different isotypes depending 
ontransitory microbial treatment, and comparison with strong cholera 
toxin immunogen. Germ-free mice were orally or systemically primed three 
times every other day by arange of intragastric (107-10 CFU) or intravenous 
(107-108 CFU) doses of E. coliHA107, compared with priming by cholera toxin or 
to germ-free control mice. a, Rarefaction plots of immunoglobulin repertoire 
sequencing at 21 days for IgM or IgG2b (inset shows same dataas in Fig. 2b, but 
different y-axis scale) in MLN and spleen. Colour coding indicates the route of 
exposure. b,c, Median number of mutations per clonotype for IgA inthe MLN 
(b) or l1gG2b in splenic B cells (c). Tukey plots ineach case are shown with 
whiskers at 1.5x interquartile range (n =3 for each condition). d,e, Germ-free 
mice were orally or systemically primed three times every other day by 
intragastric (10° CFU) or intravenous (10° CFU) delivery of E. coli HA107, or by 
intragastric (15 pg) or intravenous (15 pg) delivery of cholera toxin B (CTB) and 
compared with germ-free control mice. Immunoglobulin repertoire 


sequencing was carried out for IgA, IgM and IgG2b at 21 days. d, Tukey plots 
with whiskers at 1.5x interquartile range indicate proportions of expanded 
clonotypes (excluding singletons) within the entire CDR3 amino acidIgA 
repertoire in ileum, MLN and spleen, or within the IgG2b repertoire in MLN and 
spleen after the indicated exposures (n =3 mice in each bacterial or cholera 
toxin priming group, n=12 germ-free control mice, two-sided Wilcoxon rank- 
sum test). Adjusted P values as shown. e, Radial plot showing median values for 
mutational levels in MLN, ileum and spleen in individual mice. Peripheral 
displays of representative network structures of clonotypes showing 
relatedness with edges representing Levenshtein distance 1 (blue). Singletons 
are shown in orange. d,e, All conditions were repeated over 10 times, except for 
cholera toxin intravenous priming, which was carried out once, and cholera 
toxin intestinal priming, which was carried out twice. All data points are from 
individual mice. 


Article 


a__ Spleen - naive IgD* B cells 


be one oe “e o of 2 p®toe? fe% ° %e °o ® Pere Tye PETE TETT TEST TT ITT T TT TT 
-S g i ‘e © @ 
ee °e & « 


Looood 


TT ISTEP ELE PTET T bs 


Systemic exposure @ mouse1 ® mouse2 @ mouse3 @ mouse 4 


Germ-free @ mouse 1 mouse 2 @ mouse 3 mouse 4 
b MLN - naive IgD* B cells 
@ @ [re ) s @ 
@. i} *e, % e® . ° ie ‘ee Pad “e,“e,°e, 
oat jf 
a . 
© en 
4 


mouse 5 
Heavy/light chain contribution Clonal relationship 
to network connections between conditions 
e@ @eG @ A OO, Oe en0e$0e 
ak ee ee ee er ITI TI ITT I TTT 


SISSSE ISSIR SL SSeS SSeS eR Se ST ST SSSR TES, PEPPPPTITITITITITIT TT ITIT TTT TTT Te 


Mucosal exposure @ mouse 1 @ mouse2 @ mouse3 @ mouse 4 @ mouse 5 
mouse 3 @ mouse 4 @ mouse 5 


@ ig. -ig. 
@ GF -ig. 


20% @ GF-GF 


26% 


54% 


Clonal relationship 
between conditions 


Heavy/light chain contribution 
to network connections 


Germ-free mouse 1 mouse 2 
Cc Spleen d Spleen culture supernatant 
oe p=0.001 _ 103 p=0.007 = 103 p<0.0001 
[oy Se = e 
S 4 = ro) 101 6.6 
oO 1S} fe} 
S = 8 yoo1_° Cans 
22 oe 8 a J 
4 _ i J 10-1) 000 @ 10-1 
ro) a > 
10-2 T T 10-2 T T 
GF Systemic GF Systemic GF Systemic 
exposure exposure exposure 


Extended Data Fig. 6| Characteristics of naive B cell repertoires following 
systemic or mucosal exposure and sites of B cell activation. 

a, b, Characteristics of naive single-cell B cell repertoires corresponding to 
class-switched single-cell repertoires shown in Fig. 2e, f, respectively. 
Germ-free mice were orally or systemically primed three times every other day 
by intragastric (10!° CFU) or intravenous (10° CFU) delivery of £. coliHA107 and 
compared with germ-free control mice. Single-cell VDJ sequencing analysis on 
day 21. Network built on combined heavy- and light-chain CDR3 amino acid 
sequences each froma single splenic (a) or MLN (b) naive IgD-expressing B cell, 
excluding singlets (n=4,272 or 5,672 for aandb, respectively). Edges show 
Levenshtein distance 1 or 2. Networks show the immunoglobulin sequence 
relationships within and between different mice in the same experiment, 
colour-coded according to mouse and treatment. Left pie charts indicate the 
percentage of edge connections that are based onaLevenshtein distance of 1 
onthe light chain compared to the heavy chain; right pie charts indicate the 
distribution of edge connections between individual B cells within or between 
exposure conditions. c, d, Germ-free mice were systemically (i.v.) primed with 
two doses of 108 CFU E. coliHA107, 7 days apart, and compared with germ-free 
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control mice. Lymphocytes from spleen were isolated 18 h after the 
intravenous injection, stained with fluorescent-labelled antibodies and 
analysed by flowcytometry.c, Overall quantifications (x +s.d.,n=11 germ-free 
and 9 systemic exposure) of live GL7*CD19* lymphocytes as a proportion of all 
CD19* lymphocytes from two independent experiments. d, Lymphocytes from 
spleen were isolated 3 d after the last injection for culture. IgA andIgG2b 
antibody levels were determined in culture supernatants at 5 d by ELISA. Plots 
show pooled data from two experiments (geometric mean, n=7 germ-free and 
8 systemic exposure for bothisotypes). e, f, Germ-free mice were reversibly 
exposed with 3 gavage doses of 10° CFU E. coliHA107 and compared with 
germ-free control mice. e, Overall quantifications (x +s.d.,n=4 for each 
condition) of live GL7*BCL6*CD19* lymphocytes asa proportion of all CD19* 
lymphocytes from MLN 24 h after the last gavage (x +s.d.).f, Lymphocytes 
from MLN were isolated 3 d after the last injection for culture. IgA andIgG2b 
antibody levels were determined at 5 din culture supernatants by ELISA. 
Geometric means, n=4 for each condition, except IgG2b intestinal 

exposure = 3. c-f, Pvalues with unpaired f¢-test are indicated as shown inthe 
figure. All data points are from individual mice. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7|CD4 T cells are required for systemic immune 
memory following intestinal exposure to reversible FE. coliHA107. 

a, b, Schematic experimental designs to Figs. 3a—c. a, Germ-free mice were 
intestinally exposed to 10" CFU reversible £. coliHA107 on alternate days or 
remained germ-free throughout. On day 21, all mice were reversibly 
intravenously exposed to 10°-10’ CFU HA107, as shown. Mice were analysed on 
day 42.b, Asina, except that the initial reversible exposure was given 
systemically with subsequent reversible intestinal exposures at different 
doses. c, Germ-free mice were mucosally exposed to 3 doses of 10° CFU E. coli 
HA107 on days O, 2and 4, or left germ-free. In both groups, half the mice were 
treated with anti-CD4 depleting antibody intraperitoneally on day —3; the other 
half received a control isotype. CD4* T cells were absent (<0.1% of blood 


leukocytes) from day O until at least day 10, but were shown to have recovered 
by day 21. On day 21, mice were intravenously primed with 10’ CFU E. coli HA107. 
Control groups included intestinally only exposed mice and untreated germ- 
free mice. d, Representative dot plots of flow cytometric analysis of the blood 
on days 0 and 21in both the isotype and anti-CD4 antibody treated groups. 

e, Bacterial flow cytometry at day 42 analysing specific bacterial surface 
binding against F. coli HA107 from serum 1gG2b of the indicated groups. 

f, Immunoglobulin repertoire sequencing for 1gG2b in the spleen on day 42. 
Euclidean distance in the MDS plot reflects the distance based on CDR3 amino 
acid sequences of the entire repertoires. Data in c-f are representative of two 
independent experiments. Each dilution series (e) or data points (f) are from 
individual mice. 
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Extended Data Fig. 8 | Experimental schemes. This figure relates to Fig. 3. each group received a second schedule of priming, but with the opposite 
a, Schematic experimental design, related to Fig. 3d-g. Germ-free mice were taxon. Recovery of germ-free status was verified after each stage. b, Schematic 
mucosally primed with three doses of 10 CFU E. coli HA107 or reversible S. experimental design, related to Fig. 3h-k. As ina, except that both reversible 


typhimurium HA218 onalternate days or remained germ-free. At day 21, half of taxa were given systemically at doses of 10° CFU, as shown. 
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Extended Data Table 1| Comparison of B cell receptor CDR3 sequences assessed in this study with previously reported data 
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CARGFDYW 1229 ANTI-ALPHA 1,6 DEXTRAN HYBRIDOMA 
CARLNYGSSYFDYW 1512 ANTI-ALPHA 1,6 DEXTRAN HYBRIDOMA 


CTGLGFDYW Polyreactive High Microbiota Reactive antigen 
CARDYYGSSYYFDYW Non-Polyreactive Low Microbiota Reactive antigen Bunker et al. 7° 


CARYYDYFDYW 1213 Germ-line IgM Abysist+Bunker et al. *° 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Flow cytometry data were acquired using BD FACSDIVA (v6.0) or FCAP ARRAY (v1.0.2) software. ELISA plates were read using the Tecan 1- 
control software. MiSeq control software (v2.6.2.1) was used to control sequence instrument operation; Real Time Analysis software (v. 
1.18.54) to perform image anaylsis, base calling and quality assignment; MiSeq Reporter software (v. 2.6.2) for demultiplexing and FASTQ 
generation; and Sequencing analysis viewer (v. 1.10.2) for quality control. Sequencing data were downloaded using LIMS (http:// 
vetsrv03.campus.unibe.ch/). 


Data analysis Data analysis was carried out with FlowJo v9 and 10, MiXCR v2.1.12, VDJtools v1.2.1, CellRanger v3.0.1, Prism v7, Microsoft Excel 2018 
and R v3.5.0. The code for analysis can be found in the github repository (https://github.com/Mucosal-Immunology-Bern/Manuscript- 
microbiota-and-B-cell-repertoire). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The raw files for the datasets generated during this study will be available on the SRA repository (bioproject PRJNA625440). The pre-processed files (clonotype list 
and properties) will be deposit on the github repository (https://github.com/Mucosal-Ilmmunology-Bern/Manuscript-microbiota-and-B-cell-repertoire). The 
associated codes for the analysis using R packages can be found in the same github repository. The list of figures that have associated raw data are given in the 
supplementary information table, specicially Figure 1a-g, Figure 2 b-f, Figure 3 c,f,g,j,k, Ext. data 1 a-h, Ext. data 2a-d, Ext. data 3 a,c,e-h, Ext.data 4e-g, Ext.data 5 a- 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size In vivo experiments using germ-free or reversibly colonized germ-free mice were performed using 3-5 mice per group which was the 
maximum number of mice that could be handled simultaneously under strict axenic conditions. We determined that these sample sizes were 
sufficient to discriminate repertoire shaping according to the site and dose of transitory microbial exposure in preliminary experiments. Given 
that many repeats of the repertoire experiments (as specified in the paper) yielded consistent results, we concluded that these sample sizes 
were sufficiently powered for the study. 
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Data exclusions No data points were excluded from the analysis. 


Replication Each experiment was repeated at least once, but usually more than twice. The exceptions were only for the experiment shown in panels e-g 
of Ext data figure 4, and the cholera toxin i.v. treatment in Ext.data figure 5d. Other treatments in Ext. data Figure 5d were all repeated 2 or 
more times. All replication attempts throughout the paper were successful. 


Randomization No randomization of mice. 1. The nature of germ-free work is to eliminate all possible confounding covariates through use of inbred animals 
and strict standardisation of animal husbandry conditions in aseptic isolators maintained under strict dark-light cycles, sterilised food and 
water. 2. Since the experimental manipulations are to transitorily alter hygiene conditions, we cannot have different treatments in the same 
cage. Therefore, we needed to keep the different hygiene conditions (e.g. transitory mucosal colonisation and germ-free throughout) 
separated. Given the litter sizes of offspring separated at weaning, full randomisation would therefore necessitate large numbers of changes 
of cage mates (as opposed to subdividing a cage with offspring of one gender of a single weaned litter) at the onset of experiments, which we 
needed to avoid to minimise animal stress. 


Blinding Blinding was present during the bioinformatic analysis. Blinding during data collection was not relevant to our study because all biological 
replicates were subjected to identical experimental conditions. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used FACS staining antibodies: 
B220-FITC (Biolegend; cat. 103206; clone no. RA3-6B2; lot no. B208579, B141300; dilution no. 1 in 100); Bcl-6-AlexaFluor647 (BD 
Biosciences, clone no. K112-91; lot no. 7055649; dilution no. 1 in 100); CD3-BV785 (Biolegend; cat. 100232; clone no. 17A2; lot 
no. B23796, B279058; dilution no. 1 in 100); CD4-PE (Biolegend; cat no. 100512; clone no. RM4-5; lot no. B259958; dilution no. 1 
in 100); CD11b-PEcy7 (Biolegend; cat no. 101216; clone no. M1/70; lot no. B213161, B259668; dilution no. 1 in 100); CD19- 
AF700 (Biolegend; cat no. 115528; clone no. 6D5; lot no. B261756, B249811, dilution no. 1 in 100); CD21-FITC (Biolegend; cat no. 
123408; clone no. 7E9; lot no. B226773; dilution: 1 in 100); CD22-FITC (Biolegend; cat no. 126106; clone no. OX-97; lot no. ; 
dilution: 1 in 100); CD23-PerCPcy5.5 (Biolegend; cat no. 101618; clone no. B3B4; lot no. B206794; dilution no. 1 in 100); CD25- 
PerCP-cy5.5 (Biolegend; cat no. 102030; clone no. PC61, lot no. B145989; dilution: 1 in 100); CD43-FITC (Biolegend; cat no. 
121206; clone no. 1B11, lot no. B121194; dilution: 1 in 100); CD80-PerCPcy5.5 (Biolegend; cat no. 104722; clone no. 16-10A1, lot 
no. B250658, B127414; dilution: 1 in 100); CD86-BV421 (Biolegend; cat no. 105032; clone no. GL-1; lot no. B187225; dilution: 1 
in 100); CD93-PE-cy7 (Biolegend; cat no. 136506; clone no. AA4.1; lot no. B236661; dilution: 1 in 100); CD138-PE (Biolegend; cat 


no. 142504; clone no. 281-2; lot no. B203320, B182142; dilution: 1 in 100); CD273-APC (Biolegend; cat no. 107210; clone no. 
TY25; lot no. B268265; dilution: 1 in 100), CXCR5-biotin (BD Biosciences, clone no. 2G8; lot no. 8080503, dilution no. 1 in 50); 
IgA-FITC (BD Biosciences, cat no. 559354; clone no. C10-3, lot no. 5089919, 7128897; dilution: 1 in 100); IgG2b-FITC (BD 
Biosciences; cat no. 553395; clone no. R12-3, lot no. 5099651; dilution: 1 in 100), IgM-PEcy7 (Biolegend; cat no. 406514; clone 
no. RMM-1; lot no. B172064; dilution: 1 in 100), Ly-77-BV421 (BD Biosciences; cat no. 562967; clone no. GL-7; lot no. B275586; 
dilution: 1 in 100), PD-1-PE (Biolegend; cat no. 109104; clone no. RMP1-30, lot no. B142661; dilution: 1 in 100); Streptavidin-APC 
(BD Bioscience; cat no. 554063; dilution: 1 in 300) 

ELISA antibody: 

Coating antibody: Goat anti-mouse IgM (Southern biotech; cat no. 1020-01); Goat anti-mouse IgG1 1070-01 (Southern biotech; 
cat no. 1070-01); Goat anti-mouse IgG2b (Southern biotech; cat no. 1090-01); Goat anti-mouse I|gG3 (Southern biotech; cat no. 
1100-01); Goat anti-mouse IgA (Southern biotech; cat no. 1040-01). Standards: Purified mouse myeloma IgM (Zymed; cat no. 
02-6800); Purified mouse myeloma IgG1 (Zymed; cat no. 02-6100); Purified mouse myeloma IgG2b (Zymed; cat no. 02-6300); 
Purified mouse myeloma IgG3 (Pharmingen; cat no. 553486); Purified mouse IgA (PharMingen; cat no. 553476). Secondary 
antibodies: Anti-mouse IgM uw chain specific-peroxidase conjugated (Sigma; A8786); Anti-mouse IgG y chain specific-peroxidase 
conjugated (Sigma; A3673); Anti-mouse IgA a chain specific-peroxidase conjugated (Sigma; A4789). 

anti-CD4 depleting antibody (BioXCell, Cat. no. BEOO03-1; lot no. 62831701) 


Validation Antibodies have been validated using serial titration of the antibody in combination with either an isotype or a fluoresence minus 
one control. 
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Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Mice used in this study were housed in the Clean Mouse Facility of the University of Bern. All mice were born and raised sterilely 
and kept in sterile fexible film isolators. C57BL/6 mice were used (8-16 weeks of age of mixed gender). 


Wild animals The study did not involve wild animals. 
Field-collected samples The study did not involve samples collected from field studies. 
Ethics oversight All mouse experiments were performed in accordance with Swiss Federal and Cantonal regulations. Permission was granted by 


the Commission for animal experimentation of the Kanton Bern. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 
Sample preparation The single cell suspensions were collected by smashing mesenteric lymph nodes, spleen or bone from freshly sacrificed mice. 
Instrument BD FACS Array, Aria Ill, Fortessa and CytoFlex. 
Software BD FACSDiva (v6.0) for collection and FlowJo (versions 9 and 10) for analysis. 


Cell population abundance Purity of each sorted sample was over 90% as confirmed by re-acquisation on the sorter (Aria Ill). 


Gating strategy Relevant gating strategies were described in Methods. We have not provided a separate figure given the standard settings, but 
could do so. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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In pathophysiology, reactive oxygen species oxidize biomolecules that contribute to 
disease phenotypes'. One such modification, 8-oxoguanine? (0°G), is abundant in 


RNA? but its epitranscriptional role has not been investigated for microRNAs 
(miRNAs). Here we specifically sequence oxidized miRNAs ina rat model of the 
redox-associated condition cardiac hypertrophy‘. We find that position-specific 08G 
modifications are generated in seed regions (positions 2-8) of selective miRNAs, 
and function to regulate other mRNAs through o°GeA base pairing. o°G is induced 
predominantly at position 7 of miR-1 (70°G-miR-1) by treatment with an adrenergic 
agonist. Introducing 70°G-miR-1 or 7U-miR-1 (in which Gat position 7 is substituted 
with U) alone is sufficient to cause cardiac hypertrophy in mice, and the mRNA targets 
of 08G-miR-1 function in affected phenotypes; the specific inhibition of 70°G-miR-1in 
mouse cardiomyocytes was found to attenuate cardiac hypertrophy. o8G-miR-1 is 
also implicated in patients with cardiomyopathy. Our findings show that the 
position-specific oxidation of miRNAs could serve as an epitranscriptional 
mechanism to coordinate pathophysiological redox-mediated gene expression. 


The oxidation of guanine within nucleic acids produces 8-oxoguanine 
(08G)?, which can pair with adenine and induce guanine-to-thymine 
(G>T) mutations in DNA®°. Despite receiving less attention, RNA is 
more vulnerable than DNA to this modification’, and 08G is formed 
in RNA during the pathogenesis of various diseases*’*. miRNAs are 
post-transcriptional regulators that recognize their mRNA targets 
through base-pairing to their seed regions (positions 2-8); each 
miRNA can suppress hundreds of mRNA molecules by reducing their 
stability and/or translation’. In an indication that the oxidation of 
miRNAs might be related to the redox state of the cell, extensive 
oxidation of miR-184 was identified in the rat cardiomyoblast cell 
line H9c2 under oxidative stress, targeting the antiapoptotic proteins 
BCL-XL and BCL-W and resulting in increased death of cardiomyo- 
cyte cells”. 

Cardiac hypertrophy—an enlargement and thickening of the heart 
muscle that can lead to heart failure—can be induced by stress, injury 
or extracellular stimuli*. Oxidative stress is known to be involved, 
and has also been implicated in adrenergic hypertrophy!. Numer- 
ous cardiac miRNAs have been found to function as negative (for 
example, miR-1" ¥ and miR-133") or positive (for example, miR-155“ 
and miR-22*) regulators of cardiac hypertrophy. miR-1is critical for 
cardiac function”, cardiogenesis and the prevention of cardiac 
hypertrophy" ©. However, during ischaemia reperfusion, the ectopic 
expression of miR-1 exacerbates arrhythmogenesis””° and injury”. 
Although it is likely that o&GeA base pairing could occur under such 
conditions, so far o8G has been regarded only as a consequence of 
oxidative damage, and its potent epitranscriptional roles have not 
been investigated. 


Oxidation of miRNA in cardiac hypertrophy 


Initially, we investigated whether hypertrophic stimulation by an 
adrenergic receptor agonist could generate reactive oxygen species 
(ROS) and oxidize miRNAs in H9c2 cells. Using flow cytometry, we 
detected a rapid increase in ROS‘ cells after treatment with phe- 
nylephrine (Fig. 1a). Of the 33.5% of cells that were enlarged, 93% 
produced ROS-—representing a 1.8-fold increase in the total pro- 
portion of ROS-producing cells after treatment (31.2%) compared 
with before treatment (17.3%) (Fig. 1b). The abundance of ROS in 
phenylephrine-induced hypertrophy was sustained for up to 48 h, as 
confirmed in an independent batch of H9c2 cells (Extended Data 
Fig. 1a, b). Inaddition, serum starvation—a precondition for stimulat- 
ing adrenergic hypertrophy—enhanced the enlargement phenotype 
with a high basal ROS level (Extended Data Fig. Ic). 

Treatment with the antioxidant N-acetylcysteine (NAC) diminished 
the phenylephrine-induced hypertrophy of MF20* cardiomyocytes 
(Fig. 1c, d); such redox dependency was also confirmed in H9c2 cells 
(Fig. le, Extended Data Fig. 1d—f) and in the AC16 human ventricular 
cardiomyocyte cell line (Extended Data Fig. 1g, h) using flowcytometry. 
Further expanding to anin vivo mouse model, we used chronic admin- 
istration of isoproterenol (ISO) (Extended Data Fig. 1i) toinduce cardiac 
hypertrophy (hearts were enlarged by approximately 15%) (Fig. 1f, g), 
which was confirmed by echocardiogram (Extended Data Fig. 1j, k) and 
was found to generate ROS (Extended Data Fig. 11). Consistent with the 
cellline results, co-treatment with NAC alleviated ISO-induced hyper- 
trophy (Fig. 1f, g, Extended Data Fig. 1k). Fluorometric analysis using 
the probe dihydroethidium (DHE) showed that the largest increase in 
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Fig. 1| Redox-dependent cardiac hypertrophy induces miRNA oxidation. 
a, b, Measurement of ROS in H9c2 cells by flowcytometry after treatment with 
phenylephrine (PE) (n=10,000 cells). Blue, ROS ; orange and red, ROS”. After 
10 min treatment, the increase in hypertrophic ROS* cells was substantial. 

c-e, Cardiomyocytes (MF20*) in phenylephrine- and/or NAC-treated H9c2 
cells.c, Inverted fluorescence microscopy images; scale bar, 50 um. d, Average 
cell size (units of 5,148 pixels; n>4 measurements, 100 cells, ImageJ); *P=0.038 
(PE), 1.0 x 10+ (PE + NAC) and 0.001 (NAC) compared with mock. e, Distribution 
of cell sizes; *P= 0.031, 0.041 (phenylephrine, left to right), 0.036 and 0.049 
(phenylephrine + NAC, left to right), respectively (mock with each condition). 
f,g, Mouse hearts after treatment with ISO and/or NAC (n= 9). f, Photographs 
of mouse hearts after the indicated treatments; the scale onthe ruleris 1mm 
and the squares are 25 mm?. HW/TL, heart weight per tibia length (mg mm). 
*P=0.032 (ISO), 7.4 x10 (ISO + NAC) and 1.3 x 10> (NAC) compared with mock. 
g, Heart weight as a proportion of body weight (HW/BW; mg/g) after the 
indicated treatments. *P=4.2 x10*. h, Quantification of o°G inISO-treated 


ROS was associated with hypertrophy in phenylephrine-treated H9c2 
cells (Extended Data Fig. 2a). 

Next, separating RNAs by size (200 nucleotides (nt)), we measured 0°G 
by enzyme-linked immunosorbent assay (ELISA) (Fig. 1h) in mouse hearts 
after treatment with ISO. We found that o°G accumulated in small RNAs 
(3.2-fold increase after treatment) toa greater extent thanin large RNAs 
(1.8-fold increase); higher than the values obtained under oxidative stress 
induced by paraquat (Fig. lh). RNA oxidation was also supported by the 
detection of redox-dependent cytoplasmic staining of o°GinISO-treated 
and in phenylephrine-treated H9c2 cells (Extended Data Fig. 2b) as well as 
in phenylephrine-treated primary rat cardiomyocytes (rCMCs) (Fig. li). 
We then further refined the analysis by measuring the amount of o8G in 
miRNAs, based on the colocalization of 0°G with Argonaute 2 (Ago2) 
(Fig. 1j, Extended Data Fig. 2c). We found increased amounts of 08G in 
miRNAs of phenylephrine-treated rCMCs (increase of approximately 
3-fold after treatment) (Fig. 1k) andin miRNAs of phenylephrine-treated 
or ISO-treated H9c2 cells (Extended Data Fig. 2d, e). 

Consistent with the above findings, dot blotting with an o°G- 
specific antibody revealed increasing oxidation of small RNAs 
from phenylephrine-treated H9c2 cells (Fig. 11, Extended Data 
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mouse hearts by ELISA; large RNA, >200 nt (5 1g); small RNA, <200 nt (0.4 pg). 
Large RNA, *P=0.014; small RNA, *P=7.0 x 10°. i, Immunofluorescence staining 
of o®GinrCMCs treated with phenylephrine and with phenylephrine + NAC. 
Scale bar, 100 um.j, Immunofluorescence staining of o°G and Ago2in 
phenylephrine-treated rCMCs. Scale bar, 100 pm. Arrows indicate colocalized 
puncta (08G*, Ago2"). k, Ratio of colocalization (0°G*, Ago2") for each condition 
compared with mock (n= 6 measurements, 100 cells); *P=0.023.1, m, Dot blot 
(1) and northwestern analysis (m) of o&Ginsmall RNAs from H9c2 cells treated 
with phenylephrine and/or NAC. Ratio, o8G intensity divided by RNA input 
(n=4);In1,*P=0.01(H9c2 PE), 0.0007 (mock NAC) and 0.011 (NAC PE). In 

m, *P=0.019.n, Dot blot analysis of o8G in gel-extracted approximately 20-nt 
miRNA, qualified in Extended Data Fig. 21; from untreated or ISO-treated 
mouse hearts. Ratio, o°G intensity divided by RNA input.*P= 0.03. Dataare 
mean +s.d. All Pvalues are obtained from two-sided t-tests; repeated with 
biologically independent samples. All graphs show data as mean + s.e.m. unless 
otherwise indicated. 


Fig. 2f) and phenylephrine-treated rCMCs (Extended Data 
Fig. 2g) in a redox-dependent manner. Northwestern blotting of 
phenylephrine-treated H9c2 cells (Fig. Im, Extended Data Fig. 2h, i) 
and ISO-injected mouse hearts (Extended Data Fig. 2j, k) showed that 
o8G occurred in miRNAs of approximately 20 nt in size, a finding that 
was also supported by dot blotting of gel-extracted miRNAs of this size 
(Fig. In, Extended Data Fig. 21, m). 


Sequencing of 08G in cardiac miRNAs 

We next attempted to develop a sequencing method for 08G in miRNAs, 
which we termed 08G-miSeq, by optimizing the immunoprecipitation 
technique and the o°G-induced G>T transversion” (Fig. 2a). Using 
cross-linking immunoprecipitation (CLIP)?*™ conditions, the immu- 
noprecipitation process was improved substantially (Extended Data 
Fig. 3a)—yielding, on average, a 3,500-fold enrichment of 0G rela- 
tive to G (Fig. 2b). The occurrence of the G>T mutation in cDNA” was 
50-60%, which was validated by cleavage of an acquired restriction 
enzyme site (Fig. 2c) and by sequencing (Fig. 2d, Extended Data 
Fig. 3b-d). 
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Fig. 2| o&G-miSeq for cardiac miRNAs. a, Schematic of the o°G-miSeq process. 
b, Analysis of o8G by immunoprecipitation, showing the amount of 

oxidized spike-in RNA (08G) relative to the non-oxidized control (G);n>3. 
Immunoprecipitation experiments were carried out under previously 
reported conditions*””. Inset, *P= 0.037; optimized condition (right), P=0.015. 
c,d, G>T mutation induced by o°GeA base pairing during RT-PCR (c, left); 
checked by susceptibility (GAAo®GTC > GAATTC) to EcoRI cleavage (c, right) 
and sequencing (d). e, Results of o°G-miSeq analysis of H9c2 cells. o°G 
enrichment is calculated as log,(IP/input), normalized 08G by input read-count. 
The dashed red line indicates P= 0.05. f, g, Results of o&G-miSeq analysis of 
phenylephrine-treated rCMCs. f, Relative o°G enrichmentis calculated as the 
o8G enrichment difference between phenylephrine-treated cells versus 


The 08G-miSeq method was then applied to H9c2 cells (Supple- 
mentary Table 1a), revealing miR-1b as the most extensively oxidized 
miRNA relative to its expression (o°G enrichment, log,(IP/input) = 7) 
(Fig. 2e). No bias from either the amount of miRNA or the G-content of 
the miRNAs was detected (Extended Data Fig. 3e). o°G-miSeq analysis 
of rCMCs (Supplementary Table 1b) revealed that miR-1b was prefer- 
entially oxidized over other miRNAs in response to phenylephrine 
treatment (relative o°G enrichment) (Fig. 2f, Extended Data Fig. 3f). 
Oxidation to o8G in miR-1b predominantly occurred inthe seed regions 
(Extended Data Fig. 3g), with an increase at positions 2, 3, 7 and 15 after 
phenylephrine treatment (Fig. 2g). 

Notably, when phenylephrine-treated rCMCs were serum-starved, 
position 7 of miR-1b was oxidized exclusively (Fig. 2h): miRNA sequenc- 
ing analysis (Supplementary Table 1c) indicated an approximately 
2-fold increase of G>T transversion only at this position, and o°G-miSeq 
revealed that the oxidation of miR-1b was most extensive upon treat- 
ment with phenylephrine (Extended Data Fig. 3h). Considerable 
oxidation of other miRNAs—including miR-184, let-7f-Sp and miR-1-3p— 
was detected in phenylephrine-treated H9c2 cells (o8G enrichment 
>3, determined relative to untreated cells), witha substantial amount of 
0G overall (log,(0°G IP) > 10) (Fig. 2i, Supplementary Table1d). Notably, 
the oxidation of miR-184 after treatment with H,O, has previously been 
observed” (Extended Data Fig. 3i). 


Oxidized miR-1 silences targets viao°GeA 


We next validated the oxidation of the miRNAs miR-1b, miR-1-3p, miR- 
184 and let-7f by confirming their significant increase during the o&G 


mock-treated cells; log,(0°G-IP) is a measure of the amount of 0°G. g, Positional 
sequence variation of miR-1b, represented as a normalized mismatch (%); 
Fisher’s exact test; *P=2.3 x10”, 6.9 x 10°, 5.6 x10“ and 4.0 x10” (left to 
right). h, Same analyses as ing except for miRNA-seq in hypertrophic rCMCs 
under serum starvation. *P=1.9 x 10™. Scale bars, 100 um. i, Results of 
o®G-miSeq analysis of phenylephrine-treated H9c2 cells. The red dots with 
arrows indicate the selected oxidized miRNAs; faint red dots indicate miRNAs 
for which 08G enrichment >3. The dashed lines indicate o°G in phenylephrine 
treatment > 10 (xaxis), and statistical significance >1(P<0.1) (yaxis). All 
Pvalues are obtained from two-sided t-tests; data are mean +s.e.m. unless 
otherwise indicated. 


immunoprecipitation experiment after treatment with an adrenergic 
agonist (Fig. 3a, b). Notably, oxidized miR-1 was enriched in the hearts 
of ISO-treated mice, despite decreased total levels upon ISO treatment 
(Fig. 3b, Extended Data Fig. 3j). Focusing on miR-1, we further examined 
whether the identified o°G positions in seed regions (positions 2,3 and 
7; denoted 20°G-miR-1, 30°G-miR-1 and 70°G-miR-1) enable miR-1 to 
recognize corresponding new target sites (20xo, 30xo and 7oxo sites) 
via o8GeA base pairing (Fig. 3c). Luciferase reporter assays showed that 
the o8G-miR-1s could silence targets through cognate oxo sites (Fig. 3d). 
The oxo sites were responsive to phenylephrine but derepressed by NAC 
treatment in rCMC cells (Fig. 3e), as well as in AC16 cells after treatment 
with phenylephrine or H,O, (Extended Data Fig. 3k, |); this indicates 
that the level of endogenous 0°G-miR-1 was sufficient to redirect target 
recognition and achieve silencing. 

To exclude the compounding effect of cellular heterogeneity, we 
performed flowcytometry analyses with dual fluorescent protein (dFP) 
reporters: we expressed a green fluorescent protein (GFP) gene that 
includes miRNA target sites, and a red fluorescent protein (RFP) gene 
that does not have the target site (Fig. 3f). We were therefore able to 
detect o®G-miR-1-dependent repression at the level of individual H9c2 
cells (Extended Data Fig. 4a), despite a low global level of endogenous 
miR-1 (Extended Data Fig. 4b). As aresult, every miR-1 oxo reporter 
was endogenously suppressed by the basal level of o°G-miR-1, which 
was confirmed by either transfecting miR-1inhibitor or cognate miR-1 
variants (Extended Data Fig. 4c). Significant phenylephrine-dependent 
repression of the 7oxo reporter was observed (Extended Data Fig. 4d, e), 
as measured by the cumulative distribution function (CDF) of relative 
activities (P= 1.56 x 10°; Kolmogorov-Smirnov test) (Fig. 3f). 
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Fig. 3| o&G-miR-1 redirects target repression via o°GeA base pairing. 

a,b, Validation of miRNA oxidation in phenylephrine-treated rCMCs (a) and 
1SO-treated mouse hearts (b) using o°G immunopreciptation followed by 
qPCR. Ina, relative o®Gis the amount of 08G normalized to the o°G-containing 
spike-in control. *P= 0.051 (miR-1b), 0.013 (miR-1-3p), 0.048 (miR-184), 0.1 
(let-7f) comparing phenylephrine and mock treatments. Dataaremean+s.e.m. 
Inb, relative level is the amount of 08G normalized to U6. miR-1 levels were 
analysed by qPCR only. *P= 0.03 (miR-1), 0.003 (o8G-miR-1) comparing mock 
and ISO treatments. Dataare mean+s.e.m.c, New target sites of oxidized miR-1 
asa result of redirection by 0°GeA base pairing. d, Luciferase reporter assays for 
20xo, 30xo and 7ox0o sites using synthesized 0°G-miR-1 (20G, 308G or 708G); 
activity is reported relative to non-targeting control (NT).*P=0.002 
(208G-miR-1), 8.4 x 10-8 (30°G-miR-1), 1.6 x 105 (708G-miR-1) relative to NT. 

e, Luciferase reporter assays for the oxo sites responding to phenylephrine 
treatment or phenylephrine + NAC treatment inrCMCs. The control siteisa 


We further examined these results by calculating the fold changes 
in fluorescence relative to the dFP reporter with no site (Extended 
Data Fig. 4f, g), to confirm the repression of the 7oxo site by the endog- 
enous 708G-miR-1 (RFP:GFP-70xo versus RFP:GFP, non-targeting (NT)) 
(Fig. 3g) and its derepression by NAC (RFP:GFP-7o0xo, mock versus 
NAC) (Fig. 3h, Extended Data Fig. 4h). Notably, relative repression 
increased when only the lowest 25% of reporter fluorescence values 
(GFP-7o0xo) were considered (Fig. 3h, right), suggesting the exist- 
ence of acell population with a high level of 70°G-miR-1. In support 
of this, we observed similar results by switching the fluorescent 
proteins in the dFP reporter, resulting in more sensitive detection 
(Extended Data Fig. 4i-l). In addition, oxidized miR-1 was able to sup- 
press seed sites, albeit less potently than unoxidized miR-1 (Extended 
Data Fig. 4m)—presumably owing to retained activity of the o°GeC 
base pairing. 


o°G-miR-1induces cardiac hypertrophy 

Although miR-1 expression has been reported to induce atrophy 
treatment with phenylephrine impaired the miR-1-induced atrophy 
of rCMCs (Fig. 4a, b, Extended Data Fig. 5a). Motivated by the detec- 
tion of o®G-miR-1, we introduced synthetic 20°G-miR-1, 308G-miR-1 or 
708G-miR-1 to rCMCs and observed a greater induction of hypertro- 
phy than was found after treatment with phenylephrine (Fig. 4b-d, 
Extended Data Fig. 5b). This effect was found to be dependent on 0°GeA 
base pairing, because substitution of o°G with U (forming 2U-miR-1, 
3U-miR-1 and 7U-miR-1)—which could suppress only cognate oxo sites 
(Extended Data Fig. 5c, d)—caused the same hypertrophic phenotype 
(Fig. 4d, e, Extended Data Fig. 5e). Concordant results were obtained 
using H9c2 cells (Extended Data Fig. 5f). Specifically, hypertrophy 
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seed site with a mismatch at position 6 (pivot)”, impairment of which 
abrogates seed-mediated target repression®’. *P=1.7 x10‘, 0.7 x10*, 0.004, 
1.3x10°,4.5 x10“ and 1.7 x 10°, respectively. f, The dFP reporter with 7oxo 
sites in GFP (RFP:GFP-7oxo, top) and CDF results in phenylephrine-treated 
H9c2 cells (bottom; n> 3). P=1.56 x 10°, two-sided Kolmogorov-Smirnov test. 
g,h, Relative fold change (GFP, 7oxo site versus no site; n > 3) inindividual 

H9c2 cells responding to 70°G-miR-1(g) and NAC treatment (h, left), plotted 
depending on the level of dFP reporter expression (estimated by log,)(RFP)). 
The right graph inh shows the reanalysed results considering only the lowest 
25% of GFP signals (within a similar range to RFP). The flow cytometer was 
equipped witha blue laser (f) or with two separate lasers (g, h; to maximize the 
excitation of both GFP and RFP). n=3; repeated with biologically independent 
samples. Inh, *P< 0.05 (see Source Data for individual values). Unless stated, all 
Pvalues were obtained from two-sided t-tests. Data are mean +s.d. unless 
otherwise indicated. 


induced by 708G-miR-1 or 7U-miR-1 substantially increased the expres- 
sion of atrial natriuretic peptide (ANP), acardiac hypertrophy marker 
(Fig. 4f, g), and enlargement of cells was confirmed by time-lapse imag- 
ing of rCMCs (Fig. 4h, Supplementary Videos 1, 2) and of H9c2 cells 
(Extended Data Fig. 5g) with cytometric quantification (Extended Data 
Fig. 5h). 

Next, we administered 70°G-miR-1to mice (Fig. 4i) and confirmed 
its delivery to the heart by quantitative PCR (qPCR) (Extended Data 
Fig. 6a). At least 10% of the heart increased significantly in size (Fig. 4i, 
Extended Data Fig. 6b), with cardiomyocytes in the interventricular 
septum increasing in size by approximately 19% (Fig. 4j, k, Extended 
Data Fig. 6c), and ANP was significantly upregulated (Fig. 41). Injec- 
tion of synthetic 7U-miR-1 (Extended Data Fig. 6d) also induced 
cardiac hypertrophy in mice (Extended Data Fig. 6d, e), as did the 
delivery of 7U-miR-1as a cardiomyocyte-specific expression vector 
(a-MHC promoter) in the presence of basal ROS levels (Extended 
Data Fig. 6f-h). 

We then generated cardiomyocyte-specific transgenic mice, in 
which the expression of 7U-miR-1 led to exacerbated hypertrophic 
phenotypes (Fig. 4m, Extended Data Fig. 6i); this was in contrast to 
transgenic mice expressing miR-1, in which no change was observed 
(Fig. 4n, Extended Data Fig. 6j-). Transgenic mice expressing 7U-miR-1 
had decreased survival rates (Extended Data Fig. 6m) and displayed 
cardiac hypertrophy (Fig. 40); the extent of the cardiac hypertrophy 
phenotype was correlated with the level of 7U-miR-1 expression in 
three independent transgenic lines throughout generations (Extended 
Data Fig. 6n-u). The hearts of transgenic mice expressing 7U-miR-1 
had enlarged cardiomyocytes inthe interventricular septa (Fig. 40-q, 
Extended Data Fig. 6q,s) and showed progression towards myocardial 
fibrosis (Fig. 4r, Extended Data Fig. 6r). 
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Fig. 4 | o®°G-miR-1 generates cardiac hypertrophy. a—e, Morphology of rCMCs 
after transfection of miR-1 (a, b; treatment with phenylephrine or with 
phenylephrine + NAC), 08G-miR-1 (c, d; 20°G, 30°G or 70G) or U-miR-1 (d, e; 2U, 
3U or 7U). Cell size was quantified using Image] (a unit of 5,148 pixels, n>59), 
relative to mock treatment (transfection with non-targeting control miRNA 
(b, d)). In box plots, the lines represent the median, first and third quartiles, 
the whiskers denote the minima and maxima, and ‘x’ marks the mean value. 
Inb, *P=1.3 x 10% (mock vs PE, control) and 2.4 x 10° (mock, control vs miR-1). 
Ind, *P=4.3 x 10* (208G), 1.9 x 10°8 (30°G), 1.7 x 10> (70°G), 2.1x 10" (2U), 

2.0 x10 (3U) and 0.012 (7U), compared with NT. f, g, ANP mRNA levels, 
measured by qPCR (n= 3) relative to mock. Dataare mean + s.d. Inf, *P=0.007. 
Ing, *P=0.021 (208G), 0.012 (70°G) and 0.034 (7U). h, Time-lapse bright-field 
microscopy images of rCMCs transfected with 708G-miR-1 or 7U-miR-1 
(Supplementary Videos 1, 2).n>3.i, Left, procedure for the intravenous (i.v.) 
administration of 70°G-miR-1into mice. Middle, Photographs of mouse hearts 
after the administration of 708G-miR-1. The values denote HW/BW (mg g”); 
*P=0.05. Right, HW/TL (mg mm») after administration of 70°G-miR-1 (with 25% 
of non-targeting) or non-targeting control; *P=0.002.j, Tissue sections from 
mouse hearts after the administration of 70°G-miR-1. Top, haematoxylin and 


o®G-miR-1 globally redirects target repression 


We used RNA sequencing to assess the transcriptome-wide effects 
of miR-1 oxidation in mouse hearts into which 708G-miR-1 had been 
delivered (Fig. 5a, Supplementary Table 2a). According to CDF analysis, 
putative target mRNAs containing the 6mer 7oxo site (position 2-7) in 
the 3’ untranslated regions (3’UTR) showed significant propensity for 
downregulation (Fig. 5a, Extended Data Fig. 7a). The same results were 
observed for 7mer 7oxo targets (positions 2-8) in each replicate or in 
combination (Extended Data Fig. 7b, c). Among putative 7oxo targets 
that were identified as differentially expressed genes (DEGs) (Fig. 5b), 
downregulated genes were predominant (n=596 downregulated versus 
n=22 upregulated genes). The downregulated DEGs were significantly 
enriched in gene ontologies related to heart function—such as car- 
diac morphogenesis, contraction and calcium ion transport (P< 0.05) 
(Fig. 5c). Gene set enrichment analysis (GSEA) and gene ontology analy- 
sis also revealed the enrichment of genes from biological pathways 
related to cardiac hypertrophy (Extended Data Fig. 7d-g)—for example 
antihypertrophic factors Gata4 and Gsk3b*”, and a pivotal calcium 
regulator Atp2a2"* (also known as Serca2). The 70°G-miR-1-mediated 
repression of Atp2a2, Gata4 and Gsk3b—the 3’UTRs of which contain 
conserved 7oxo sites (Supplementary Table 3a)—was further validated 
inrCMCs by qPCR. (Fig. 5d). 

Similar results were obtained for 70°G-miR-1 in H9c2 cells (Supple- 
mentary Table 2b), as shown by CDF analysis (Extended Data Fig. 8a, b) 
and by volcano plots (Fig. 5e). Notably, CDF analysis also revealed seed 
sites in 70°G-miR-1-mediated downregulation—although the extent of 
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the repression was marginal (Extended Data Fig. 8c, d). The 7oxo tar- 
gets were also enriched in gene ontologies of heart development and 
morphogenesis function (P< 0.05) (Extended Data Fig. 8e). Extending 
the analysis to 20°G-miR-1 and 308G-miR-1 (Supplementary Table 2c, d), 
RNA sequencing and CDF analysis revealed that these miRNAs induced 
significant repression of cognate targets (Extended Data Fig. 8f-h) and 
the functions of these targets in gene ontologies of heart morphogen- 
esis (20xo targets), blood circulation (20xo and 3oxo targets) and heart 
contraction (30xo targets; Extended Data Fig. 8i,j). Some 7oxo targets 
in rCMCs were further confirmed by qPCR (Fig. 5f): Gjal, Dram1 and 
Pon2, both 7oxo and seed sites (Supplementary Table 3b); and Pcgf6 
and Bicd2, 7o0xo site only (Supplementary Table 4a, b). Additionally, 
Fbxo32 (also known as atrogin 1)”°, Pten® and Usp18’’—reported to be 
negative regulators of cardiac hypertrophy—were also validated as 
targets of 70°G-miR-1 (Extended Data Fig. 8k), because they contain 
conserved 7o0x0o sites in 3’UTRs (Supplementary Table 4c). 

Next, we used RNA-seq (Supplementary Table 5a) to examine the find- 
ing that o&G-miR-1 targets are globally repressed in ISO-induced hyper- 
trophic hearts (positions 2, 3 and 7) (Extended Data Fig. 81): this was 
especially evident in 7oxo targets (Fig. 5g) and was somewhat evident 
in 30xo targets (Extended Data Fig. 8m). Genes that were downregu- 
lated in ISO-treated hearts (log, ratio < O) were significantly enriched 
upon 708G-miR-1-dependent repression (GSEA; Fig. 5h) but only mar- 
ginally enriched during 20°G-miR-1-dependent repression (Extended 
Data Fig. 8n). Finally, targets of 708G-miR-1 were experimentally vali- 
dated to be suppressed in either phenylephrine-treated rCMCs (Gjal, 
Atp2a2 and Gsk3b, Fig. 5i; Bicd2, Extended Data Fig. 9a) or ISO-treated 
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hypertrophic mouse hearts (Gjal and Gata4, Fig. 5i; Bicd2 and Atp2a2, 
Extended Data Fig. 9b). 


70°G-miR-1in cardiomyopathy and its loss of function 


We then broadened our investigation to the hearts of patients with car- 
diomyopathy. We retrieved and analysed data fromthe high-throughput 
sequencing of Ago-associated RNA”, isolated from the left ventricles of 
human hearts by crosslinking immunoprecipitation (Ago HITS-CLIP) 
(n= 6; Supplementary Table 5b-d). Although the rate of G>T muta- 
tions was low, the same pattern (positions 2, 3 and 7) was detected asin 
o°G-miR-1 (Fig. 6a). The G>T mutations were strongly clustered into two 
groups: one comprising a group of patients (1, 2, 4 and 5) with the high- 
est frequency of mutations at position 7, along with some mutations 
at other positions (positions 2, 3 and 12) (Fig. 6b); the other (patients 
3and 6) with the highest frequency of mutations exclusively at position 2 
(Fig. 6b). In Ago-mRNA clusters”’, miR-1 oxo sites were overrepresented 
more than expected (Fig. 6c), equivalent to an average of around 18% 
of seed sites, exceeding the G-U pairing (around 10%) and control sites 
(around 7%). Moreover, validated 708G-miR-1 targets—such as ATP2A2, 
GSK3B, BICD2, GJA1, GATA4, DRAM1, PTEN and FBXO32 (Extended Data 
Fig. 9c-j) and HSPB7 (Fig. 6d)—were found to map the 7oxo sites in 
Ago-mRNA peaks (CLIPick”’), which were confirmed to be functional 
by a luciferase reporter assay (Fig. 6e, Extended Data Fig. 9k). 
Independently, by conducting covalent ligation of endogenous 
Argonaute-bound RNAs with cross-linking and immunoprecipita- 
tion (CLEAR-CLIP)* experiments that produce ligated chimaera reads 
between miR-1and target sites (Extended Data Fig. 91, Supplementary 
Table 6a), we found that HSPB7—which has been identified as a risk 
gene for idiopathic dilated cardiomyopathy*°—also had a 7oxo site 
(Fig. 6d). We observed that the frequency of Ago binding in 7oxo sites 
was increased in ISO-treated mouse hearts, in contrast to a decrease 
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in its binding in seed sites (Extended Data Fig. 9m, n). The 7oxo site in 
HSPB7 (Fig. 6d, Extended Data Fig. 90) was further validated by qPCR 
(Extended Data Fig. 9p). We also checked the activity of the 7oxo site 
in conjunction with the seed site in G/AL”, and observed combinatorial 
cooperativity for target repression in luciferase reporters (Extended 
Data Fig. 9q). 

Finally, adapting the concept of miRNA sponges”, the physiological 
relevance of 70°G-miR-1 was evaluated using a loss-of-function study, in 
which competitive inhibitors against miR-1 (anti-seed) and 70°G-miR-1 
(anti-70xo) were prepared to contain a tandem repeat of cognate tar- 
get sites (Extended Data Fig. 10a). After validating their specificity 
using a luciferase reporter assay (Extended Data Fig. 10b, c) and RNA 
sequencing (Supplementary Table 6b, Extended Data Fig. 10d), we 
introduced anti-70xo (4 sites; 4x) to rCMCs and observed the attenua- 
tion of phenylephrine-induced hypertrophy (Extended Data Fig. 10e). 
Next, several transgenic mouse lines expressing anti-7oxo in cardio- 
myocytes (a-MHC promoter, 13x) (Fig. 6f) were generated (Extended 
Data Fig. 10f, g) and confirmed to show specific expression of anti-70xo 
(Extended Data Fig. 10h, Supplementary Table 6c) and global derepres- 
sion of 708G-miR-1 targets (Fig. 6g, Extended Data Fig. 10i). There was 
no difference in basal heart size, but ISO-induced hypertrophy was 
significantly prevented in all different transgenic lines of both F, and F, 
generations (Fig. 6h, Extended Data Fig. 1la—e) without enlargement of 
cardiomyocytes (Extended Data Fig. 11f-h). Overall, we conclude that 
position-specific oxidation of miR-1—in particular to form 708G-miR-1— 
serves as an endogenous driver of cardiac hypertrophy and diseases. 


Discussion 

Here we demonstrate that the activation of adrenergic receptors gener- 
ates ROS and leads to the formation of 08G in the seed region of miR-1. 
miR-1is thus redirected to repress new targets that function in cardiac 
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Fig. 6 | 70°G-miR-1is implicated in cardiomyopathy and its loss of function. 
a, b, Sequence variation in Ago-associated miR-1, either mismatch rate (%) in 
seed as average (a) or the individual rate (%) (b) for each position. The 
Ago-associated RNA data, obtained by HITS-CLIP from the left ventricle of 
human hearts, are from ref.”*. c, Relative frequency (%) of miR-1 oxo sites 

(GA at position 2G, 3G or 7G) relative to seed sites in the normalized Ago-mRNA 
clusters?®. G-U, wobble pairing sites; control, control sites (GeG, no pairing; 
Exp, expected frequency; miR-124, seed site of miR-124, not expressed in heart). 
d, miR-17o0xo site in 3’UTRs of HSPB7 (top), overlapping with the Ago-mRNA 
peaks (P< 0.05, CLIPick”’; red). Other peaks are shown in grey. The sequences 
of the site are shown at the bottom; 7oxo sites in other genes in Extended 

Data Fig. 9c-j.e, Validation of the 7oxo sites by luciferase reporter assay. 
Half-maximal inhibitory concentrations (IC;,) of GATA4, ATP2A2 and BICD2 are 
stated in Extended Data Fig. 9k; sequences of the sites are provided in 


hypertrophy pathways (Extended Data Fig. 11i, Supplementary Discus- 
sion). Levels of o&G-miR-1 immediately increase upon treatment with an 
adrenergic agonist and are sustained for up to a week in hypertrophic 
hearts (Extended Data Fig. 11j). Furthermore, sequences of the miR-1 oxo 
sites are evolutionally conserved in 3’UTRs (Extended Data Fig. 11k). Our 
findings suggest that the formation of o8Gin miRNA could be the gen- 
eral epitranscriptional mechanism by which ROS—under pathophysi- 
ological conditions—can impart fine-tuning of gene expression, asa 
part of redox signalling that ultimately leads to phenotypic alterations. 
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Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Adrenergic cardiac hypertrophy depends on ROS. 

a, b, ROS measurement in H9c2 cells after phenylephrine treatment ina 
time-dependent manner, detected by fluorescent ROS dye (DHE) and flow 
cytometry (10,000 cells); M1 (blue), cells with moderate ROS; M2 (red), cells 
with high level of ROS (a). The results were further analysed for cell size; 
log,.(forward scatter), y axis; hypertrophy, cell size >3 (b). c, Same analysis asin 
a, bexcept subjecting to serum starvation and showing distribution of cell size 
in left panels; number of cells or counts (%), y axis (left); n=3 biologically 
independent experiments. d, Morphology of H9c2 cells in the presence of 
phenylephrine and/or NAC treatment, quantified for their average cell size 
relative to NAC treatment (n> 40 cells, ImageJ; top left graph); NT versus 
phenylephrine, P= 9.5 x 107; error bars, s.e.m. Immunofluorescence staining of 
cardiomyocytes, discriminated by staining of sarcomeric isoforms of myosin 
heavy chain (MF20, a marker of cardiomyocyte differentiation). Notably, 
differentiated cardiomyocyte lineage cells (MF20*) were quantified owing to 
different extent of fate heterogeneity in H9c2 cardiomyoblast cells; the same 
batch of H9c2 cells used in a—c; scale bar, 50 pm. e-h, Same analyses asinc 
except examining the effect of NAC treatment in H9c2 cells (e, f) and using AC16 
cells (g, h); n= 3 biologically independent experiments; H9c2 cells in all of these 


figures (a-h) were obtained from Korean Cell Line Bank. Of note, prevention of 
ISO induced cardiac hypertrophy by NAC treatment indicates that ROS 
productionis probably upstream of the hypertrophic phenotype. i,j, Chronic 
intraperitoneal (IP) injection of 75 mg kg ‘ISO and PBS (used as control) was 
conducted for mice (n=7 biologically independent mice) in every two days for 
29 days (i) and their echocardiography results (n= 3 biologically independent 
mice) were obtained (j); IVS, interventricular septum; LVID, left ventricular 
internal diameter; LVPW, left ventricular posterior wall; EF, ejection fraction 
(%); FS, fractional shortening (%); LV, left ventricular; d, diastolic; s, systolic; 
*P=0.024 (IVS;d, ISO versus Cont) and 0.027 (IVS;s, ISO versus Cont), 
respectively; data are mean+s.e.m.k, Same experiments as performed in 

i,j, except treating NAC and using echocardiography at day 21;*P=0.001, 
0.004, 0.031 and 0.033, respectively; n= 3 biologically independent mice; data 
are mean +s.e.m.I, Measurement of ROS in lysate (100 pg proteins) fromthe 
ISO-induced hypertrophic mouse hearts by using fluorescent ROS dye, 
CM-H,DCFDA. *P=0.046. All Pvalues from two-sided t-test; *P< 0.05;n=3 
biological independent samples; data are mean +s.d. unless otherwise 
indicated. 
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Extended Data Fig. 2 | Adrenergic cardiac hypertrophy oxidizes miRNAs. 

a, Extent of ROS and alteration of cell size was monitored by using ROS 
fluorescence dye (DHE) in phenylephrine-treated H9c2 cells in time-dependent 
manner; scale bar, 50 pm. Notably, sustained elevation of ROS was also 
observed (up to 48 h) by flow cytometry (Extended Data Fig. 1a, b). 

b, Immunofluorescence staining of oxidized nucleic acids in ISO- and/or NAC- 
treated H9c2 cells by using 08G-specific antibody (top). Cytoplasmic pattern 

of o8G staining disappeared upon treating with RNase, confirming that o°G 
occurs in RNA (middle). Phenylephrine-treated H9c2 cells were also examined 
(bottom); scale bar, 50 ppm. c-e, Immunofluorescence staining of o°G and 
Ago2inISO- and/or NAC-treated rCMCs (c), phenylephrine- and/or NAC- 
treated H9c2 cells (d), and ISO- and/or NAC-treated H9c2 cells where their 
colocalization was quantified (e); coloured arrow, cytoplasmic puncta stained 
as both Ago2 and 08G (d); n> 4;*P=0.012 (e, right); Scale bars, 100 um (c) or 

50 um (d, e). Every independent experiment (n>3) showed similar results (a-e). 
f, Dot blot analyses of o°G for small RNA, extracted from phenylephrine- and/or 
NAC-treated H9c2 cells (top left) with quantification (relative to small RNA 
input in lower left panel; right); n=3;*P=0.009. g, Same analyses performed in 


phenylephrine- and/or NAC-treated rCMCs as inf. h-k, Northwestern blotting 
of 0G for total RNA (20 pg), extracted from NAC-treated H9c2 cells (h), 
phenylephrine- and/or NAC-treated H9c2 cells (i) and ISO-injected mouse 
hearts (j) with quantification of normalized intensity (intensity ratio, o&G 
signal per amount of RNA;k) in20 nt RNA (indicated with arrow or asterisk); 
n=Sbiological independent mice; Mock, acontrol set with injection of PBS; 
HW/BW, heart weight per body weight (j); 20 nt-o°G, synthesized 20-nt-long 
RNA with 08G used as a size marker; *P= 0.024 (k). 1, miRNA was isolated from 
total RNAin mouse hearts based on their -20 nt size (left; *), where existence of 
muscle-specific miR-1 was confirmed by qPCR with no detection of a brain- 
specific miRNA, miR-124 (right); M, synthesized 20-nt-long RNA used as asize 
marker; P=4.24 x 10 (miR-1 versus miR-124). m, Dot blot analysis of o&G was 
performed for the gel-extracted miRNAs (40 ng), derived fromISO-treated 
hypertrophic mouse hearts (left) and the results were quantified as ink (right); 
error bars, s.d.;*P= 0.029; n=3 biologically independent samples. All Pvalues 
from two-sided f-test; *P< 0.05; n= 3 biological independent samples; data are 
mean +s.e.m. unless otherwise indicated. 
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Extended Data Fig. 3 | Development of 0°G-miSeq to identify oxidized 
miRNA and 0°G position for cardiac hypertrophy. a, Optimization of o°GIP 
by adjusting IP and wash condition, estimated by using synthesized RNA 
containing 0Gas spike-in control (100 pg or 10 pg) followed by qPCR; relative 
enrichment, amount of o°G RNA per non-oxidized RNA. “Ab masking” denotes 
preincubation of 08G antibody with non-oxidized G;*P=0.04, 1.6 x10~°, 1.3 x10* 
and 0.02, respectively; n > 3; error bars, s.e.m. The detailed process and the 
composition of buffers are described in Supplementary Methods. b-d, 
Sequencing results of cDNA, reverse transcribed from synthesized 0°G RNA 
(Fastq files, http://clip.korea.ac.kr/oxog/; b). Sequencing reads with <2 
mismatches (2MM) were analysed for the frequency of variation, found inthe 
introduced position of 0°G; based on raw counts (c, top); based on unique 
reads, to exclude artificial bias derived from PCR amplification (c, bottom; 
discriminated by degenerative barcode, details in Supplementary Methods). 
Numbers of variation in every position of synthesized RNA were also 
represented; G (d, top) versus 0G (d, bottom), position 7; read-count, y axis. 

Of note, ~-50-60% G>T variation rate is in agreement with thermal denaturation 
transitions obtained from RNA:DNA duplexes**”». e, o&G-miSeq results from 
H9c2 cells. o°G enrichment, log ratio of o&GIP normalized to the input read 
counts for miRNAs, was plotted depending onthe miRNA abundance (input; 

e, left), number of Gin sequences as heat map density (e, right). f, o°G 
enrichment (log,(IP/input)) was analysed depending on the level of miRNA 
(log, value of read-count from miRNA-seq) in the absence (left) versus 
presence of phenylephrine treatment (right); miR-1b, red dot indicated by red 
arrow. g, The positional sequence variation of miR-1b is represented asa 
normalized mismatch (%) in o&G-miSeq results from H9c2 cells; error bars, 
s.e.m. The 0°G in miR-1b was found to be enriched predominantly inseed region 
(positions 2, 3 and 7) based on G>T mutation rate. Notably, o8G at position 15 


was also observed to be comparable to that of the other positions in seed. 

h, o°G-miSeq in phenylephrine-treated rCMCs with serum starvation. Relative 
o8G enrichment was plotted for miRNAs depending on the extent of 
phenylephrine-induced oxidation (log,(0°G IP) in phenylephrine). Of note, 
miR-1b showed the most marked phenylephrine-induced oxidation (relative 
o8G enrichment = 0.8) witha considerable level of o8G (log,(08G-IP)>10, dotted 
redline). i, o°G-miSeq result of H9c2 cells was compared with previous results 
of oxidized miRNAs inH,0,-treated H9c2 cells"°, which were identified by using 
o®GIP and microarray. The previous results were represented inside of the 
current o&G-miSeq data, highlighted as a yellow circle with indicated arrowin 
the plot of o8G enrichment (amount in 0°G IP relative to abundance) and 
abundance of miRNA (log,(input)). Notably, except miR-184, the previously 
identified oxidized miRNAs were not so much enriched in the o°G-miSeq 
results, presumably due to disregarding background from different abundance 
of miRNA or lack of 08G IP optimization in the previous trial. The discrepancy 
also could be caused by heterogeneity of H9c2 cell lines, which behave 
differently depending on culture condition and batches.j, Relative o8G of miR-1 
inISO-treated mouse heart (chronic injection) was measured by o8G IP and 
qPCR; the amount of miR-1 in 08G IP was normalized by the amount of 0°G- 
contained spike-in control; error bars, s.e.m.;*P= 0.002. miR-1 showed the 
most dramatic enhancement (fivefold). Of note, elevation of o®G-miR-lin 
chronic injection of ISO implicated that o&G-miR-1 could persist during chronic 
stress for cardiac hypertrophy. k, I, Luciferase reporter assays with oxo sites of 
miR-las performed in Fig. 3e except for AC16 with phenylephrine (k) or H,O, 
treatment (I); *P=0.001, 6.1x 10, 0.001, 0.025, 0.027, 0.034 and 0.012, 
respectively (k);*P=2.3 x10*, 0.002, 0.001, 0.007, 0.018, 2.0 x 10° and 1.210, 
respectively (I). All Pvalues from two-sided t-test; *P< 0.05; n=3 biological 
independent samples; data are mean +s.d. unless otherwise indicated. 
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Extended Data Fig. 4 | Oxidized miR-1 silences new target sites viao°GeA 
base pairing. a, dFP (dual fluorescent proteins) reporter with miR-1seed sites 
in GFP (top) was validated to detect miR-1 dependent repression at the 
endogenous level of individual H9c2 cells; NT, non-targeting control; miR-1, 
synthesized miR-1 used as positive control; miR-1inhibitor for blocking 
endogenous miR-1. b, Expression level of miR-1in AC16 cells, H9c2 cells, rCMCs 
and mouse heart, measured by qPCR relatively to U6; error bars, s.e.m. Notably, 
heterogeneity of AC16 and H9c2 cells could result in low global level of miR-Lin 
total small RNAs, implicating the worth of examining miR-1at individual cell 
level.c, Same dFP reporter analyses as performed ina except for 7oxo, 30xo and 
20x0 sites; 7U-miR-1, 3U-miR-1 or 2U-miR-1, used as positive control. Of note, 
every miR-1 oxo site (70xo, 30xo and 2oxo site) inthe dFP reporters was 
endogenously suppressed with sensitivity to detect the repression mediated 
by the basal level of o®G-miR-1, confirmed by either transfecting miR-1 inhibitor 
or cognate miR-1 variants. d, Phenylephrine-dependent enlargement of H9c2 
cells, measured by size (log,.(FSC)) using flow cytometer. e, Distribution of GFP 
value (log,.(GFP)) in dFP reporter, of which GFP contains miR-17oxo sites (left). 
Relative repression (log,)(GFP/RFP)) was calculated by averaging the reporter 
fluorescence values (GFP-7oxo) with a similar range of control fluorescence 
values (RFP) in the cells (right). The relative repression was also examined in 
phenylephrine-treated H9c2 cells. Of note, flowcytometry ina, c—-e used a blue 
laser for the excitation. f, g, Quantification of dFP reporter in H9c2 cells by flow 
cytometry. Based onscatter plot of GFP versus RFP (f), GFP value (log,)(GFP)) 
with 7oxo sites (RFP:GFP-site; red) was examined by averaging GFP values 
derived from similar range of control RFP values in the cells (g), further 
normalized as relative fold change (Fig. 3g), normalized by dFP reporter with 
no site (RFP:GFP-no site; black); 70°G-miR-1, used as positive control (purple). 
Vectors with no fluorescence protein gene (control) were used to measure level 


of autofluorescence. h, Same analyses as inf except for considering NAC 
treatment. Of note, 10 times greater amount of reporter vector was usedinh 
thaninf, resulting in fewer transfected cells inf. i-k, Flowcytometry analysis 
using switched dFP protein reporter, of which fluorescent proteins were 
interchanged ini, top, resulting in sensitive detection of the suppression of 
miR-17oxo site by the endogenous level of 70°G-miR-1 (i,j); experiments with 
positive control, 70°G-miR-1 (j). Repressive propensity of RFP with 7oxo site 
(GFP:RFP-site; i, middle) was shown in merge (i, right), relative to RFP with no 
site (GFP:RFP-no site; i, left), showing similar tendency of shifting; relative fold 
change, normalized RFP values denominated by dFP reporter with no site 
(RFP:GFP-no site; black). 1, Same analysis as ini-k with switched dFP reporter 
except using phenylephrine (left) and miR-1inhibitor (right). Of note, 
phenylephrine-induced repression of miR-1 70xo sites became more 
substantial when only alimited cell population, for which reporter values (RFP) 
were inthe lowest 25%, was considered (middle). Observation of the restored 
reporter activity (RFP) by introducing miR-1inhibitor (right) further confirmed 
that phenylephrine-dependent repression of miR-17o0xo sites was mediated by 
miR-1. The flow cytometer used in f-I was equipped with two separate lasers to 
maximize the excitation of both GFP and RFP, resulting in increased sensitivity 
of the detection. m, Luciferase reporter assays with miR-1 seed sites inthe 
presence of 5nM of control (cont; NT), 20°G-miR-1, 30°G-miR-1, 70°G-miR-1and 
miR-1;*P= 0.005, 0.4 x 10° and 0.1* 107‘; error bars, s.e.m.; n= 6 biological 
independent samples. Notably, introduction of oxidized miR-1 (20°G, 30°G and 
70°G) could also suppress seed sites in luciferase reporter albeit less potent 
than unoxidized miR-1, presumably because of retained activity of o&G+C base 
pairing. All Pvalues from two-sided t-test; *P< 0.05; n=3 biological 
independent samples; data are mean +s.d. unless otherwise indicated. 
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Extended Data Fig. 5 | Oxidized miR-1 elicits hypertrophy of cardiomyocyte 
through 0°G-A base pairing. a, Effect of miR-1 expression onrCMCs inthe 
presence of phenylephrine and/or NAC treatment; scale bar, 100 pm. 

b, Morphology of rCMCs after the transfection of oxidized miR-1 (20°G, 30°G or 
70°G); scale bar, 100 pm. Every independent experiment (n >3) showed similar 
results (a, b).c, Sequences of o&G-miR-1 mimics, of which 08G was substituted 
with U (2U-miR-1, 3U-miR-1and 7U-miR-1), and its cognate sites (20xo, 30xo and 
7oxo site). d, Luciferase reporter assays with different concentration of 
2U-miR-1, 3U-miR-1 or 7U-miR-1 for its cognate 20xo, 30x0 or 7ox0o site (red), 
compared with miR-1 seed site (blue). Of note, U-miR-1(2U, 3U or 7U) cannot 
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repress seed site. e, rCMC with transfection of miR-1, 2U-miR-1, 3U-miR-1or 
7U-miR-1; scale bar, 100 um. f, H9c2 cells with transfection of miR-1, oxidized 
miR-1 (208G, 30°G or 70°G) or U-miR-1(2U, 3U or 7U). Phenylephrine treatment 
was used as a positive control inducing hypertrophy; scale bar, 50 um. g, Time 
lapse images of H9c2 cells transfected with 70°G-miR-1, 7U-miR-1 or miR-1; scale 
bar, 50 pm. Every independent experiment (n > 3) showed similar results (e-g). 
h, Distribution of H9c2 cell size after 7U-miR-1 transfection (flowcytometry, 
10,000 cells, n=3); NT, non-targeting miRNA control. Pvalues from two-sided 
t-test, (d, h);*P< 0.05; n= 3 biological independent samples; data are mean + 
s.d. unless otherwise indicated. 
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Extended Data Fig. 6 | See next page for caption. 


Extended Data Fig. 6 | 70°G-miR-1 induces cardiac hypertrophy in vivo. 
a,b, In vivo injection of 7o°G-miR-1 as polyethylenimine (PEI) complex. Non- 
targeting control (NT) was injected together to check the delivery; “Cont”, 
5xNT (100% NT); “708G”, IxNT (1:4; 25% NT, 75% 70°G-miR-1); 5 mg kg” (left); IV, 
intravenous tail injection. The delivery rates were estimated by qPCR (log, ratio 
relative to the negative control (—), normalized to U6, n=4 biological 
independent mice); error bars, s.d.;*P=7.2 x 10 >and 0.017, respectively 

(a, right). The mouse hearts were displayed in b. c, Immunofluorescence 
staining of interventricular septum from 70°G-miR-1 delivered mouse hearts; 
wheat germ agglutinin (WGA), cell border, red; MF20, cardiomyocyte, green; 
DAPI, nucleus, blue; scale bar, 100 pm. Every independent experiment (n> 3) 
showed similar results. d, e, Same experiments as ina, b except using 7U-miR-1 
as mixture of NT (1:4, 5mg kg”) and PEI (N/P ratio = 8); *P=0.002 and 0.029, 
respectively (d, right); representative mouse hearts (e). f, Cardiomyocyte- 
specific expression vector (pJG/ALPHA MHC), which contains a-MHC (alpha 
myosin heavy chain) promoter to drive cardiomyocyte specific expression of 
7U-miR-1 (a-MHC::7U-miR-1). g, Injection of o-MHC::7U-miR-1 plasmid 

(1.9 mg kg?) as complex with PEI (N/P ratio = 8) into PBS- or NAC-administrated 
mice (100 mg kg", IP injection; time interval of -8 hat the same day). Delivery 
of the plasmid to heart tissue was confirmed by qPCR of hGH poly (A) 

inthe plasmid. h, a-MHC::7U-miR-1 delivered mouse hearts (left) and the 
quantification of their size (HW/TL; right); box plots with median line, 

first and third quartile; whiskers, minima and maxima (right); n > 4 biological 
independent mice; *P= 0.022 (7U-miR-1 versus others). Of note, expression of 
7U-miR-1lin cardiomyocyte promoted enlargement of hearts only inthe 
absence of NAC treatment, implicating that basal level of ROS is required for 
70°G-miR-1-induced cardiac hypertrophy. i, Generation of cardiomyocyte- 
specific transgenic mouse with expression of 7U-miR-1(7U-miR-1TG); 
transgene cassette, generated from a-MHC::7U-miR-1 plasmid by treating 
BamHI (top). Genotyping results of two independent founders (F,; #7, #43; 
bottom left) and section of their hearts (bottom right) at the age of p73 (#7) 
and p56 (#43). Of note, both 7U-miR-1 TG showed marked increase in heart size 


(HW/TL = 13.23 and 10.08, relative to average size of HW/TL =-5S.0), especially 
evident in the size of right ventricle (rv); lv, left ventricle. j—-I, Generation of 
cardiomyocyte-specific transgenic mice of miR-1 (a-MHC::miR-1 TG) following 
the same process as used ini. Notably, there was no change in heart size of 
o-MHC::miR-1 TG; Fy (HW/TL = 4.0; J); F,, TG(-) versus TG(+) (k, left); HW/TL, 
n>4(k, right). WGA staining of heart tissue from a-MHC::miR-1 TG (TG(-) 
versus TG(+)), showing no difference in cell size (I, left); 40 xm; quantification 
of relative cell size (n= 70), Image (I, right). m, Survival rate (%) of a-MHC::7U- 
miR-1TG (F,).n, Hearts of 7U-miR-1TG (F,; TG(-) versus TG(+)), derived from3 
independent founders (#33, #37, #44; top); HW/TL (bottom). Expression of pri- 
miR-1 was examined by qPCR to infer the copy number of transgene; relative 
expression (“pri-miR-1”, bottom); *P=4.8 x10, 0.0009, 0.014, 0.009, 0.0007 
and 0.002, respectively; n = 3 biologically independent mice. 0, Ventricle 
region of 7U-miR-1TG heart (F,, #33, TG(-) versus TG(+); H&E staining); scale 
bar, 1mm. p, Measurement of total miR-1 level, contributed by expressing 
7U-miR-1 transgene (7U-miR-1TG, F,, #33); TG(+) vs TG(-); qPCR results 
normalized by U6 abundance (miR-1/U6); n= 9. Of note, expression of 
7U-miR-1 transgene added negligible amount to endogenous miR-1in hearts, 
implicating physiological relevance of 7U-miR-1 transgenic mouse. 

qt, Immunofluorescence staining of cardiomyocytes in the interventricular 
septum of 7U-miR-1TG (F,, #33, q) and their Masson’s trichrome staining (r); 
WGA, red; MF20, green; DAPI, blue; scale bar, 100 pm.s, H&E staining of heart 
section of 7U-miR-1 TG (F,, #37, TG(-) versus TG(+); top). Enlargement of cells 

in 7U-miR-1TG was observed (bottom); rectangular part of the top panel is 
represented; H&E staining; scale bar, 100 ppm. t, Hearts of 7U-miR-1TG (F,; TG(-) 
versus TG(+)), derived from 3 independent founders (#33, #37, #44). Section of 
7U-miR-1TG heart (F,, #33, TG(—-) versus TG(+); H&E staining) was also displayed 
(t, bottom left); scale bar, 1mm. u, The heart size was quantified for 7U-miR-1TG 
(F,; TG(-) versus. TG(+); #33 (n= 11 versus 6), #37 (n =13 versus 12), #44 (n=5 
versus 7)); HW/TL; *P= 0.0001, 8.6 x 10° and 0.009, respectively. All Pvalues 
from two-sided t-test; *P< 0.05; n= 3 biological independent samples; data are 
mean +s.e.m. unless otherwise indicated. 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7 | Transcriptome-wide analysis of 70°G-miR-1-delivered 
mouse hearts. a, Kolmogorov-Smirnov test results for Fig. 5a. There are 
significant downregulation (P=5.4 x 10 7°) of mRNAs with 7oxo site (6mer, 
positions 2-7, in3’UTR) and significant upregulation (P=1.6 x 10°) of MRNAS 
with no site (neither miR-1 7oxo site nor seed site, 6mers) depending on 70°G- 
miR-1expression, relative to total (total mRNAs). But, mRNA with control site 
(“cont site”, mismatched site at position 6 of miR-1in3’UTRs but no 7oxositein 
the transcript, previously reported to be unrecognized by miRNA*~) showed no 
significance. b, CDF analysis for mRNAs with miR-1 7oxo site (7mer, position 
2-8, in3’UTR, n= 952, red line) depending on 708G-miR-1-mediated fold change 
(log2 ratio, x-axis), conducted for each replicate pairs (Exp A, Band C; 
Supplementary Table 2a) and also for combined results (Exp(A+B+C)). 

c, Kolmogorov-Smirnov test results for b, relative to total. d, Functional 
annotation results for downregulated DEG with miR-1 7oxo site in Fig. 5b, 
performed by DAVID (https://david.ncifcrf.gov) using BIOCARTA and KEGG 
pathways (top left). The most significantly enriched pathway, “skeletal mouse 
hypertrophy regulated via AKT/mTOR pathway” (P= 0.02) in BIOCARTA were 


indicated with other components of hits in the table (red; bottom left) and 
pathway map (red star; right). GSK3B is highlighted with red dotted circle. 
Notably, activation of GSK3B has been reported to suppress cardiac 
hypertrophy’. e-g, Concentrating on heart-related pathways in the Kyoto 
Encyclopedia of Genes and Genomes (KEGG) database, the normalized 
enrichment score (NES) from gene set enrichment analysis (GSEA) elucidated 
“hypertrophic cardiomyopathy” (HCM, NES =1.25;e, right), “arrhythmogenic 
right ventricular cardiomyopathy” (ARVC, NES =1.20;f, right), and “dilated 
cardiomyopathy” (DCM, NES = 1.25; g, top) as pathways regulated by 70°G- 
miR-1. Heat maps on the top indicate the mRNAs in the corresponding pathway 
with measured expression; red, downregulation; blue, upregulation; order of 
results from top, NT (n= 3), 708G-miR-1 (n= 3). Hits from downregulated DEG 
with miR-17oxo site in Fig. 5b were also displayed as red starsin HCM €e, left), 
ARVC (f, left) and DCM (only a distinct part from HCM and ARVC is displayed; 
g, bottom). ATP2A2 (SERCA2) is highlighted with red dotted circle and 

GJA1 (CX43) with blue dotted circle. 
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Extended Data Fig. 8 | Transcriptome-wide analysis of o°G-miR-1 
transfected H9c2 cells. a,d, CDF analysis as performed in Fig. Sa except for 
70°G-miR-1-transfected H9c2 cells; all 7oxo sites (a), 7mer 7ox0o site (positions 
2-8; b), 6mer seed site (positions 2-7; c) and 7mer seed site (positions 2-8; d). 
Kolmogorov-Smirnov test (two sided) results were displayed in the table 

(b,c, d, top right table); mean. Control inc, d denotes mismatched site at 
position 6 of miR-1in3’UTRs but no seed site in the transcript. e, Gene Ontology 
analysis results of 7o°G-miR-1 targets identified in H9c2 cells (Fig. 5c); intensity 
of bubble colour, Pvalue (derived from DAVID); bubble size, frequency of the 
Gene Ontology term. Highly similar Gene Ontology terms are linked by edges in 
the graph, where the line width indicates the degree of similarity. Of note, only 
Gene Ontology terms in biological process were used in this analysis. f-h, CDF 
analysis as ina-d except for 20°G-miR-1 (f) or 30°G-miR-1 (h) by using RNA-seq 
results (Supplementary Table 2c, d); Kolmogorov-Smirnov test (two sided) 
results inh. i,j, Gene Ontology analysis results as performed ine except for 
208G-miR-1 (i) and 30°G-miR-1 (j); P< 0.05 (derived from DAVID), n=176 
(208G-miR-1), n= 34 (308G-miR-1). 208G-miR-1 targets and 30°G-miR-1 targets 


(-) Log (308G-miR-1 / NT) 
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were selected as containing 6mer sites (position 2-9) in3’UTR and showing 
log, ratio <—0.8 inf, g. The biggest cluster was displayed as representative (i,j). 
k, Validation of putative 70°G-miR-1 targets by qPCR in 70°G-miR-1-transfected 
rCMCas performed in Fig. 5d, f. As control, GAPDH (for FBXO32 and PTEN) or 
ACTB (for USP18) was also measured for normalization. FBXO32, PTEN and 
USP18, of which 3’UTRs contain 7oxo sites (Supplementary Table 4c) were 
examined; t-test, two-sided; *P= 0.003, 5.4 x 10*,2.1x10* and 0.044, 
respectively; n> 3 biological independent samples; data are mean +s.e.m. 
Notably, PTEN also contains a miR-1seed site in its 3’UTR. I, m, CDF analyses as 
conducted in Fig. 5g except for all o°G-miR-1 targets (20xo, 30xo or 7ox0o sites; 1) 
or only for 20°G-miR-1 (m, left panel) or 30°G-miR-1 targets (m, right).n, GSEA 
results performed as in Fig. 5h except using 20°G-miR-1 (top) and 30°G-miR-1 
(bottom) dependent changes of putative cognate mRNA targets (6mers, 
3’UTR) in H9c2 cells (Supplementary Table 2c, d) for the enrichment of 
ISO-dependent downregulation (log ratio < 0); NES, normalized enrichment 
score; Pvalue, adjusted Pvalue derived from GSEA. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | Identification of 70°G-miR-1 targets and their 70xo 
sites during adrenergic cardiac hypertrophy and cardiomyopathy. 

a, Experimental validation of Bicd2 as 70°G-miR-1 target (7oxo site) in 
phenylephrine-treated rCMCs. Relative expression was measured by qPCR, 
normalized by Actb; *P= 0.024; n=3 biological independent samples; error 
bars, s.e.m. b, Same validation experiments asin a except using ISO-treated 
mouse hearts for Bicd2 (left, normalized by Actb) and Atp2a2 (right, normalized 
by Gapdh);*P=0.005 and 0.06, respectively; n> 4 biological independent 
samples; error bars, s.e.m. c—j, miR-1 7oxo sites (red) in Ago bound regions 
(3’UTR) of patients with heart disease for ATP2A2 (c), GSK3B (d), BICD2 (e), 

GJA1 (f), GATA4 (g), DRAM1 (h), PTEN (i) and FBXO32 (j), represented asin 

Fig. 6d; identified in the Ago-mRNA peaks (red; sequences in Supplementary 
Tables 3 and 4) with significant peak height (P< 0.05 or 0.01, CLIPick”’, black); 
with miR-1seed site (orange). k, Luciferase reporter assays with different 
concentrations of 70°G-miR-1 for 7oxo sites, derived from GATA4, ATP2A2 and 
BICD2. Half-maximal inhibitory concentration (IC;9) was calculated. Of note, 
repression mediated by miR-17oxo site in GATA4 (IC) =1.2 nM) or ATP2A2 

(ICs) = 0.7 nM) was efficient, albeit the maximum repression was only ~15-20% 
(left and middle). By contrast, miR-17oxo site in BICD2 exerted better maximum 
repression (52%) with low efficiency (IC;)=11nM; right). -n, CLEAR-CLIP 
analysis for ISO induced hypertrophic mouse hearts, yielding miRNA-target 
chimaera reads (I). miR-1 containing chimaera reads were aligned and analysed 


for the frequency of miR-1 seed, 20xo, 30xo, and 7oxo sites depending onISO 
treatment, examined by expanding the size of the sequences from the miR-1 
ligated site (tO, 25,50 or 100 nt; m); box plots with median line, first and third 
quartile; whiskers, minima and maxima (n). Although only limited number of 
chimaera reads were yielded for miR-1 (Supplementary Table 6a), global 
propensity of its target interaction could be analysed for their changes 
depending onISO-treatment. 0, Sequence of conserved miR-1 7oxo site in 
HSPB7, which overlapped with miR-1 chimaeras in ISO-treated mouse hearts 
and Ago-mRNA peaks of patients with cardiomyopathy (Fig. 6d). p, Validation 
of Hspb7as miR-17oxo target by performing qPCR in 70°G-miR-1 transfected 
rCMC (normalized by Gapdh, left) and ISO-injected mouse heart by qPCR 
(normalized by Actb, right).-, NT transfection (left) or mock (right); *P=0.04 
and 0.01, respectively; n =3 (left) or 5 (right) biologically independent samples; 
error bars, s.e.m.q, Ago-bound miR-1seed and 7oxo sites identified in the 
3’UTR of G/A1 in patients with cardiomyopathy, as represented in c-j; left. 
Activity of luciferase reporters, containing the indicated regions (left), were 
examined depending on the concentration of 70°G-miR-1 (right). Of note, 
despite containing the miR-1seed site, core-Aoxo completely lost repression by 
70°G-miR-1, implicating the importance of 7oxo site to execute miR-1 
dependent repression of G/AI. All Pvalues from two-sided t-test, *P<0.05;n=>3 
biological independent samples; data are mean+s.d. unless otherwise 
indicated. 
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Extended Data Fig. 10 | Specific inhibition of 70°G-miR-1 by competitive 
inhibitors, anti-70xo. a, Schematics of competitive miRNA inhibitors, which 
consist of tandem repeats of miRNA target sites (7mer, position 2-8) in RNA 
oligonucleotide; “Cont”, NT seed sites (2x); anti-seed, miR-1seed sites (2x); 
Anti-7oxo, miR-17oxo sites (2x). Base pairing of miR-1 or 70°G-miR-1 with 
cognate target site (seed or 7oxo site) was represented. b,c, Luciferase reporter 
assays used to validate activity and specificity of competitive miRNA inhibitors 
(anti-seed and anti-7oxo) in a. Luciferase reporter vector containing miR-1 seed 
sites or 7oxo sites was constructed and used to confirm derepression of the 
reporters by anti-seed (b) and by anti-7oxo (c); “Cont”, anti-NT (containing seed 
sites of NT); -, NT; *P=1.87 x10 and 5.08 x 10™”, respectively; n=8 biologically 
independent samples (b); *P= 0.038; n= 5 biologically independent samples 
(c); error bars, s.e.m. Of note, anti-7oxo significantly derepressed 7oxo site 
mediated target repression, albeit the extent was somewhat marginal. This is 
possibly because of containing only two repeats of target sites in the inhibitor. 
Notably, anti-7oxo showed no effect on seed site mediated target repression, 
confirming the specificity of anti-7oxo (c).d, CDF analysis to globally confirm 
derepression of 7oxo targets by anti-7oxo (9x); “7oxo site”, MRNAs with 7oxo 
site (7mer, position 2-8, in3’UTR); “No site”, mRNAs with neither 7oxo site nor 
seed site (6mers, positions 2-8, in3’UTR); “Cont”, transcripts witha 

mismatch site at position 6 of miR-1 (negative control; 7mer, positions 2-8, in 
3’UTR) but without 7oxo (left) or seed site (right). Expressed transcripts 

(FPKM >1in bothsamples, valid status from Cuffdiff, n = 7,958) from RNA-seq 
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(Supplementary Table 6b) were used for the analysis. Notably, anti-7oxo 

(9x) significantly derepressed miR-1 7oxo sites (P= 0.038, relative to control; 
Kolmogorov-Smirnov test, two sided; left) but not miR-1seed sites (P= 0.987, 
relative to control, Kolmogorov-Smirnov test, two sided; right). e, Time-lapse 
images of rCMC, transfected with either cont (anti-NT (4); left) or anti-7oxo 
(4x; right) after phenylephrine treatment (with serum starvation); scale bar, 
100 pm. Introduction of anti-70xo to rCMCs attenuated phenylephrine- 
induced hypertrophy. f, Generation of cardiomyocyte-specific transgenic mice 
(TG) with expression of anti-7oxo (13x); transgene cassette, generated from 
a-MHC::anti-7o0xo (13x) plasmid by BamHI. g, Genotyping results of four 
independent founders (Fy; #65, #68, #69, #44). h, Mapping result of RNA-seq 
reads (Supplementary Table 6c) for the anti-7oxo transgene cassette, 
expressed in hearts of ISO-treated TG (F,, #68). Notably, TG(+) showed 
sufficient expression of anti-7oxo (13x) sequences in heart tissue. i, CDF 
analyses of RNA-seq results (Supplementary Table 6c) to confirm anti-7oxo 
(13x) mediated derepression of 70°G-miR-1 targets (7oxo sites) as analysed ind. 
Notably, significant derepression of 70°G-miR-1 targets (as measured by log, 
(TG(+)1s0/TG(-)s0)) was observed (P= 0.021, relative to “Cont”, Kolmogorov- 
Smirnov test, two sided ; P=4.1* 10°, relative to “no site”); by contrast there 
was nochange for miR-1seed targets (right). All Pvalues from f¢-test, two-sided; 
*P<0.05;n>=3 biological independent samples; data are mean+s.d. unless 
otherwise indicated. 
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Extended Data Fig. 11| See next page for caption. 
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Extended Data Fig. 11 | Loss-of-function study of 70°G-miR-1 in vivo by 
establishing an anti-7oxo transgenic mouse. a, The ISO treatment schedule 
for anti-70xo TG, cardiomyocyte-specific transgenic mice with expression of 
anti-7oxo (a specific sponge inhibitor against 70°G-miR-1); intraperitoneal (IP) 
injection. b, Heart size of anti-7oxo TG(-) versus TG(+) in the presence of ISO 
administration (three independent F, lines, #69, #68, #44; left); HW/BW; error 
bars, s.d.;*P=0.014; n> 3 (right). c, Genotyping results of F,, generated from 
two different F, lines (#71, top; #77, bottom). d, Section of hearts from 
littermates of two different lines (#71, left; #77, right); TG(-) versus TG(+) under 
ISO treatment. e, Heart size in d was quantified as HW/TL (bottom) or HW/BW 
(top); box plots with median line, first and third quartile; whiskers, minima and 
maxima; x, mean; *P=0.007 and 0.013. Of note, ISO-induced hypertrophy is 
prevented in anti-7oxo TG (F,). f, H&E stained tissue sections of interventricular 
septum (IS) of anti-7oxo TG (F)); n=3; each experiment was repeated 
independently with similar results. g, Immunofluorescence staining of 
cardiomyocytes in IS of anti-70xo TG (Fo); wheat germ agglutinin (WGA) for 
refining the cell border; MF20 for cardiomyocytes; DAPI for nuclear staining; 
scale bar, 100 pm. h, Relative cell size of MF(+) ing was quantified (n=200, 
Image]; *P=3.2 x10~”); box plots with median line, first and third quartile; 
whiskers, minima and maxima; “x”, mean. Notably, size of cardiomyocyte 


was maintained in anti-7oxo TG even in the presence of ISO treatment. 

i, Aschematic model of the position-specific oxidation of miR-1; 70°G-miR-1 
targets identified in the adrenergic receptor (AR) pathway, red; others, blue. 
j, Quantification of o°G-miR-1 in ISO-injected mice in time-dependent manner. 
After IP injection of ISO, hearts were collected at different time points (0,1,5 
and 7 days; top), of which small RNA was used to measure relative amount of 
o®G-miR-1 to 08G spike-in control by performing 0°G IP and qPCR (miR-1; 
bottom). Notably, sustained oxidation of miR-1 was observed up to 7 days from 
ISO treatment; *P= 0.01, 0.03 and 0.01, respectively; n=4 biologically 
independent mice; error bars, s.e.m. k, Sequence conservation of 08G-miR-1 
target sites in3’UTR. Conservation rates for miR-1 oxo sites (20xo, 30xo and 
7ox0o sites) including seed site were indicated in cumulative graph (left) and 
distribution (right) of seed sites of conserved miRNAs (conserved; blue line) 
and all heptamers (total; black line). Notably, sequences of miR-120xo, 30xo 
and 7oxo sites were evolutionally conserved in 3’UTR (11.3, 13.0 and 12.3%, 
respectively) as muchas seed sites of conserved miRNAs in mammals—higher 
than median conservation rate (10.5%, 6mers) but lower than miR-1 seed site 
(14.5%). All Pvalues from two-sided t-test, two-sided; n =3 biological 
independent samples; data show the mean unless otherwise indicated. 
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The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 


Only common tests should be described solely by name; describe more complex techniques in the Methods section. 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
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Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 
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Data collection Muse 1.5 software (FACS data collection and analysis) 


BD accuri C6 Plus 1.0.23.1 software (FACS data collection and analysis) 

Attune NxT software version 3.1 (FACS data collection and analysis) 

Leica Application Suite V3.3 (for fluorescence image acquisition and analysis) 

iBright FL100 Imaging System (dot blot) 

GloMax-Multi detection system software version 1.3.2 (Luciferase reporter assays) 

SRA toolkit (ver. 2.8.2) was used for downloading raw sequencing files from SRA database. 


Data analysis ImageJ (https://imagej.nih.gov/ij/) 


Bowtie with allowance of 2 mismatches (Bowtie --norc -| 7 —n 2 -a—f) ), where the collapsed sequencing reads were indexed (using 
bowtie-build indexer with default parameter). 

Treeview (http://rana.|bl.gov/EisenSoftware.htm) for heat-map analyses 

ImageJ (https://imagej.nih.gov/) for image quantitation 

RNA-Seq analysis was performed by using Tuxedo Suite; TopHat2 (http://ccb.jhu.edu/software/tophat), Cufflinks and Cuffdiff (http:// 
cole-trapnell- lab.github.io/cufflinks/). 

Bowtie (http://bowtie-bio.sourceforge.net) for mapping sequencing reads to miRNAs 

CLIPick (http://clip.korea.ac.kr/clipick/) for Peak analysis of CLIP data 

GO analysis was performed by using DAVID (http://david.abcc.ncifcrf.gov/) with default parameters unless otherwise indicated, visualized 
with REVIGO (http://revigo.irb.hr/). 

Enrichment of functional annotation was analyzed by using GSEA (http://software.broadinstitute.org/gsea/). 

Python scripts for o8G-miSeq and Spike-in (http://clip.korea.ac.kr/oxog/). 
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Raw sequencing data were deposited to SRA; SRP189806, SRP189807, SRP189808,SRP189813, SRP189117, SRP189812, SRP189811, SRP189809,SRP189810, 
SRP213998, SRP214400, SRP226125, SRP228274 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No power analyses were used to predetermine sample sizes. However, sample sizes were chosen based on prior studies using similar 
experimental procedures. For most qPCR analyses we analyzed 3 sets of biological samples. The variability between the three biological 
replicates was very minimal and therefore did not require increasing the sample size. 


Data exclusions No data were excluded from the analyses 

Replication For in vivo experiments, biological replicates as well as independent cohorts of mice were used. 
For in vitro experiments, technical triplicates as well as independent experiments on separate days were performed to ensure reproducibility. 
All other data were replicated in at least two independent experiments as stated in figure legends 


All attempts at replication were successful. 


Randomization No randomization was used in this study. For in vitro experiments, indicated celllls were transfected with indicated plasmids or RNAs, or 
treated with indicated drugs. For in vivo mice experiments, mice were grouped according to the drug treatment 


Blinding There was no blinding to group allocation. However, staffs, who did drug or RNA injection to mouse, had no idea of expecting outcome. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


[| Human research participants 


[| Clinical data 


Antibodies 


Antibodies used Anti-o8G antibody (15A3), QED Bioscience, Cat# 12501, lot# 102411 
(1:1000 for Immunofluorescence staining, 1:2000 for dot or Northwestern blotting, 1.25-1.78yg/RNA (yg) for 
immunoprecipitation) 
Anti-Ago2 antibody, Abcam (ab5072), Cat# ab32381, Lot# GR71413-1 
(1:200 for Immunofluorescence staining) 
Anti-Ago2 antibody (2A8), DIAGENODE, Cat# C15200167-100, Lot# 003 
(1Oug for CLEAR-CLIP) 
Anti-Ago2 antibody (2E12), Abnova, Cat# HO0027161-K; Lot# H5221-S2 


(1Oug for CLEAR-CLIP) 
Anti MYH1E antibody, Developmental Studies Hybridoma Bank, Cat# MF20, Lot#:12/7/17 
(1:1000 for Immunofluorescence staining) 


Validation Anti-o8G antibody (1543); eukaryotic; ELISA, IHC, AP, IP; PLoS One 3, e2849, 1183 (2008): J Biol Chem 280, 20978-20986, (2005): 
FASEB J 21, 2753-2764, (2007): Mol Cell 59, 50-61, (2015); IF, IB, IP data were provided in the current manuscript. 
Anti-Ago2 antibody (Abcam); ChIP grade; mouse, rat, human; IP, IHC-P, ChIP, RIP, WB, ICC/IF; Nature 565:500-504 (2019): Nat 
Struct Mol Biol 25, 1019-1027 (2018); IF data were provided in the current manuscript 
Anti-Ago2 antibody (2A8); human, mouse, other (wide range); IP, HITS-CLIP, WB, IF, IHC; Nucleic Acids Res 42, 4629-4639 
(2014); HITS-CLIP data were provided in the current manuscript. 
Anti-Ago2 antibody (2E12); ChIP grade; human, mouse, rat; IB, IP; Nature 465, 584-9 (2010); HITS-CLIP data were provided in the 
current manuscript 
Anti MYH1E antibody (MF20); human, mouse, rat, chicken; ELISA, FACS, FFPE, IF, IC, IP, IB; J Biol Chem. 283, 26484-9 (2008): 
Development, 141: 4320-4331 (2014): Scientific reports 8, 2678 (2018); IF data were provided in the current manuscript 
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Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) H9c2: ATCC (H9c2(2-1); CRL-1446) or Korean Cell Line Bank 
AC16: gift from Drs. J. Han (purchased from Milipore; scc109, terminated product) or H.-W. Lee and W.J. Park (originated as 
gift from Dr. M.M Davidson; Columbia Univ.) 
HeLa: ATCC (CCL-2) or Korean Cell Line Bank 


Authentication H9c2 and HeLa were obtained from ATCC or Korean Cell Line Bank, of which cell lines have been thoroughly tested and 
authenticated. For each cell line used here, we always check its shape, growth condition and any contaminants by observing 
microscope. General cell maintenance is around 48 hr interval, suggesting normal cell proliferation. Moreover, by performing 
RNA-Seq frequently in each cell line, we have confirmed the identity of cell lines with features including its own species of 
orgin. In this current study, we also did replicate experiments with multiple batches from different provider to see whether 
the results were reproducible. 


Mycoplasma contamination Confirm that all cell lines were tested as negative for mycoplasma contamination 


Commonly misidentified lines = None 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals C57BL/6 mouse, male & female, 8-12 weeks old 
Transgenic mice of C57BL/6 (a-MHC:miR-1, a-MHC :miR-1-7U, a-MHC: anti-7oxo), male & female, 5-12 weeks old 
Sprague Dawley rat, male & female, neonatal (day1) 


Wild animals This study did not involved in the wild animals. 
Field-collected samples The study did not involve samples collected from the field. 
Ethics oversight Korea University Institutional Animal Care and Use Committee (AICUC) 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation Fluorescence protein reporter-transfected H9c2 cell or drug-treated H9c2 cells were collected by treating 1x trypsin-EDTA for 3 
minutes and neutralizing with DMEM with 10% FBS. Collected cells are dissolved in 1x PBS (Biosesang) with 4% BSA (Bovogen) 
and 5mM EDTA (Sigma) and analyzed by flow cytometers. 

For using Muse Cell Analyzer, H9c2 and AC16 cells were prepared as the method suggested by manufacturer. 


Instrument BD accuri C6 Plus (BD Biosciences, equipped with blue(488nm) laser), 
Attune NxT flow cytometer (Invitrogen, equipped with blue(488nm) and yellow(561nm) laser) 
Muse Cell Analyzer (Millipore) 


Software BD accuri C6 Plus software (data collection and analysis) 
Attune NxT software (data collection and analysis) 
Muse Software (data collection and analysis) 
NovoExpress software (ACEA Biosciences; for data analysis and conversion to csv), Excel 


Cell population abundance Cells (n=10,000) were examined. Flow cytometry was used for quantification purposes only (no postsorting fractions were 
collected) 


Gating strategy For all experiments FSC-A/ SSC-A gates of the cell population were used to discriminate viable cells from debris. Singlet and 
doublet cells were discriminated using FSC-A/ FSC-H gating. Psicheck2 vector (no fluorescence protein expressed) is used for 
distinguish autofluorescence and expressed fluorescence proteins. Then, scatter plots of GFP vs. RFP signals (n=10,000 cells) 
were analyzed. An example for gating strategy is presented in Extended Data Fig. 4i-k. 
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® Check for updates 


The histone deacetylases (HDACs) are a superfamily of chromatin-modifying 
enzymes that silence transcription through the modification of histones. Among 
them, HDAC3 is unique in that interaction with nuclear receptor corepressors 1 and 2 
(NCoR1/2) is required to engage its catalytic activity’ *. However, global loss of HDAC3 


also results in the repression of transcription, the mechanism of whichis currently 
unclear* ®, Here we report that, during the activation of macrophages by 
lipopolysaccharides, HDAC3 is recruited to activating transcription factor 2 
(ATF2)-bound sites without NCoR1/2 and activates the expression of inflammatory 
genes througha non-canonical mechanism. By contrast, the deacetylase activity of 
HDAC3 is selectively engaged at ATF3-bound sites that suppress Toll-like receptor 
signalling. Loss of HDAC3 in macrophages safeguards mice from lethal exposure to 
lipopolysaccharides, but this protection is not conferred upon genetic or 
pharmacological abolition of the catalytic activity of HDAC3. Our findings show that 
HDAC3 is a dichotomous transcriptional activator and repressor, with a non-canonical 
deacetylase-independent function that is vital for the innate immune system. 


In contrast to the lethality conferred by the global loss of HDAC3, 
genetic inactivation of its enzymatic activity by disruption of the 
deacetylase-activating domain (DAD) of NCoR1 and NCoR2 is compat- 
ible with life with no overt physiological abnormalities®. Such evidence 
highlights the importance of the non-enzymatic functions of HDAC3, 
which remain poorly characterized and have critical implications for the 
development of deacetylase-targeting drugs such as suberoylanilide 
hydroxamic acid (SAHA), an HDAC inhibitor that has been approved 
by the Food and Drug Administration for the treatment of cancer and 
is being tested for efficacy in other diseases’ °. Loss of HDAC3 dimin- 
ishes the Toll-like receptor 4 (TLR4)-mediated hyperinflammatory 
response to lipopolysaccharide (LPS) in macrophages owing to the 
impaired activation of pro-inflammatory genes; however, the mecha- 
nism is unknown‘. Here we identify anon-canonical mechanism in which 
HDAC3 switches between its two enzymatic states to contribute to both 
repression and activation of gene transcription ina single cell type. 

To determine whether HDAC3 has functions that are independent 
of its deacetylase activity, we used mouse bone-marrow-derived mac- 
rophages (BMDM) with myeloid-specific loss of HDAC3 (MHD3KO) that 
were cultured in vitro after retrovirally mediated expression of anempty 
control vector, the catalytically dead HDAC3(Y298F) mutant (contain- 
ing atyrosine-to-phenylalanine substitution at residue 298) or wild-type 
HDAC3 (WT-rescue)**. The expression of HDAC3(Y298F) was titrated 
such that amounts of both the mRNA (Fig. 1a) and the protein (Fig. 1b) 
were comparable to those of endogenous HDAC3. The deacetylase activ- 
ity of HDAC3 immunoprecipitated from HDAC3(Y298F)-expressing 
cells was markedly reduced, to a level indistinguishable from that of 
MHD3KO macrophages (Fig. Ic). 

We next investigated the role of the catalytic activity of HDAC3 inthe 
transcriptional response to acute LPS challenge in vitro. As expected, 


LPS induced substantial changes in control macrophages, including 
the upregulation of classic inflammatory markers such as //6, /la, Il1b 
and Nos2. Approximately one-third of the genes that were differentially 
expressed after LPS stimulation (762/2,152) were not differentially 
expressed inthe MHD3KO macrophages (Fig. 1d, Extended Data Fig. la-c). 
Thetranscriptome of MHD3KO macrophages is shifted towards an alter- 
natively activated phenotype’, which could affect the response to LPS. 
Nevertheless, expression of more than half of the HDAC3-dependent, 
LPS-responsive genes (385/762) was rescued by the HDAC3(Y298F) 
mutant (referred to hereafter as deacetylase (DA)-independent genes), 
and these genes highly overlapped with those expressed in control and 
WT-rescue macrophages under LPS stimulation (Extended Data Fig. 1d). 

We focused in particular on the 174 genes that were upregulated by LPS 
inamanner that was dependent on HDAC3 but not onits enzymatic activ- 
ity (Fig. le). The profile included pro-inflammatory genes, suchas those 
encoding cytokine production and activation of NF-kB signalling (Fig. If). 
By contrast, expression of the 142 genes that were downregulated upon 
LPS stimulation in an HDAC3-dependent manner was not rescued by the 
HDAC3(Y298F) mutant (Extended Data Fig. le, f; referred to hereafter as 
DA-dependent genes). These genes were functionally most notable for 
downregulation of TLR signalling—including T/r4 gene transcription 
(Extended Data Fig. 1g)—which suggests a potential negative feedback 
mechanism in response to endotoxin”. Overall, the HDAC3(Y298F) 
mutant phenocopied the inflammatory transcriptional response to LPS 
stimulation of wild-type HDAC3, whereas the transcriptional response to 
the anti-inflammatory cytokine IL4 was enzyme-dependent (Extended 
Data Fig. 1h), indicating that the enzyme-independent activity of HDAC3 
was highly specific to the LPS response. 

To understand the mechanisms that underlie the DA-independent 
and DA-dependent transcriptomes, we performed chromatin 


‘Institute for Diabetes, Obesity, and Metabolism, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA. “Division of Endocrinology, Diabetes, and Metabolism, 
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Fig. 1| HDAC3 activates LPS-stimulated inflammatory gene expressionina 
DA-independent manner. a, Hdac3 mRNA levels in control, MHD3KO, 
HDAC3(Y298F)-rescue (KO+Y298F) and WT-rescue (KO+WT) macrophages as 
quantified by quantitative PCR (qPCR). Data are mean +s.d. (n=6 biological 
replicates), Pvalues calculated by one-way ANOVA. b, Western blot (performed 
independently twice, n=3 biological replicates) of HDAC3 protein levels in 
control, MHD3KO, HDAC3(Y298F)-rescue and WT-rescue macrophages, with 
vinculin as loading control.c, Deacetylase activity of HDAC3 
immunoprecipitated from control, MHD3KO, HDAC3(Y298F)-rescue and 
WT-rescue macrophages, as measured by the fluorescent signal of the 
enzymatically cleaved substrate. Data are mean +s.d. (n=3 biological 
replicates). Pvalues calculated by one-way ANOVA. d, Scatter plot of RNA-seq 
data showing transcripts-per-kilobase-million (TPM)-transformed average 
read counts from vehicle (control) or LPS-treated control macrophages (n=4 
biological replicates, TPM > 0.1) and HDAC3-regulated LPS-responsive genes 


immunoprecipitation followed by high-throughput sequencing (ChIP-seq) 
analyses for HDAC3 (Extended Data Fig. 2a, b). LPS stimulation substantially 
increased HDAC3 genomic occupancy within 100 kilobases (kb) of thetran- 
scription start sites of previously identified DA-independent genes (Fig. 2a), 
whereas HDAC3 binding was unchanged near DA-dependent genes (Fig. 2b). 

We next assessed the functionality of HDAC3 binding sites using two 
independent measures of enhancer activity, namely Global Run-On 
sequencing (GRO-seq) and acetylated histone H3 lysine 27 (H3K27Ac) 
ChIP-seq. Globally, neither loss of HDAC3 nor LPS stimulation per- 
turbed enhancer RNA (eRNA) production (Extended Data Fig. 2c) or 
H3K27ACc levels (Extended Data Fig. 2d, e). However, at HDAC3 binding 
sites near DA-independent genes, LPS stimulated a marked increase in 
eRNA production (Fig. 2c, Extended Data Fig. 2f) as well as H3K27Ac 
levels (Fig. 2d), which were attenuated in the MHD3KO macrophages. 
Conversely, enhancer activity (Fig. 2e, Extended Data Fig. 2g) and 
H3K27Ac abundance (Fig. 2f) proximal to DA-dependent genes were 
reduced by LPS stimulation and considerably induced upon loss of 
HDAC3; this is consistent with HDAC3 functioning as a repressor via 
histone deacetylation at these sites. Although functional enhancers 
might not always regulate the nearest genes”, the transcriptional con- 
gruity between HDAC3-regulated enhancers and HDAC3-regulated 
genes supports a direct enhancer-promoter regulatory relationship. 
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(pink, upregulated; light blue, downregulated; differential expression (DE) 
cutoff: |log,(fold change (FC))| >1, two-sided Benjamini-Hochberg-adjusted 
false discovery rate (FDR) < 0.05, as determined by edgeR likelihood ratio test). 
DA-independent (red hollow circle) are genes for which LPS-stimulated 
changes were rescued by HDAC3(Y298F) and wild-type HDAC3. 
Deacetylase-dependent (blue hollow circle) are genes for which LPS-stimulated 
changes were rescued by wild-type HDAC3 but not by HDAC3(Y298F). e, Heat 
map showing 174 LPS-upregulated, HDAC3-dependent differentially expressed 
genes that were rescued by HDAC3(Y298F) and wild-type HDAC3 
(DA-independent genes: n= 4 biological replicates, DE cutoff: |log,FC|>1, 
two-sided Benjamini-Hochberg-adjusted FDR < 0.05 as determined by edgeR 
likelihood ratio test). f, Gene ontology analysis of 174 LPS-upregulated 
DA-independent genes (n=4 biological replicates). g values represent 
Benjamini-Hochberg-adjusted one-sided hypergeometric P values for 
over-representation as determined by Enrichr. 


Because HDAC3 does not have intrinsic DNA-binding capability, we 
considered whether the selective engagement of its catalytic activity 
was determined by transcription factors that facilitate genomic recruit- 
ment. De novo motif exploration of HDAC3-bound LPS-stimulated 
enhancers revealed the canonical motifs bound by the macrophage 
lineage determinant PU.1and by NF-KB-p65, as expected (Extended Data 
Fig. 2h). Application of a transcription factor binding analysis (TBA)” 
based on machine learning to DA-independent and DA-dependent 
HDAC3-bound genomic regions revealed that DNA-sequence fea- 
tures intrinsic to these distinct genomic regions could predict dif- 
ferent outcomes (Extended Data Fig. 3a). Specifically, TBA identified 
ATF2as the most statistically significant positive cobinding partner at 
DA-independent sites, whereas ATF3 was predicted as the transcription 
factor most likely to be responsible for the outcome of DA-dependent 
genes (Fig. 3a). Notably, ATF2 was predicted to be negatively corre- 
lated with HDAC3 binding at DA-dependent sites, whereas ATF3 might 
negatively influence the binding of HDAC3 at DA-independent sites 
(Fig. 3a); other members of the AP-1 family did not display such robust 
differential binding affinity (Fig. 3b, Extended Data Fig. 3b, c). Moreo- 
ver, HDAC3-bound eRNAs were more abundantly transcribed when 
the region contained the ATF2 motif than when it contained the ATF3 
motif (Extended Data Fig. 3d). 
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Fig. 2 | Differential recruitment and transcriptional functions of HDAC3 at 
DA-independent and DA-dependent LPS-responsive genes. a, b, Average 
density profiles in reads per million (RPM) of HDAC3 ChIP-seq data 
(mean ts.e.m.;n=3 biological replicates) at enhancers near DA-independent 
genes (two-sided Wilcoxon’s P= 2.6 x 10) (a) or DA-dependent genes 
(two-sided Wilcoxon’s P= 0.084) in vehicle or LPS-treated macrophages (b). 
c,d, Average density profiles in RPM (mean +s.e.m.;n=3 biological replicates) 
of bidirectional eRNA transcription measured by GRO-seq (control LPS-treated 
versus MHD3KO LPS-treated; two-sided Wilcoxon’s P= 7.7 x 102°) (c) or 
H3K27Ac levels measured by ChIP-seq (control LPS-treated versus MHD3KO 
LPS-treated; two-sided Wilcoxon’s P= 4.2 x 10’) (d) at HDAC3-bound sites near 
DA-independent genes in control and MHD3KO macrophages, with or without 
LPS.e, f, Average density profiles in RPM (mean +s.e.m.;n=3 biological 
replicates) of bidirectional eRNA transcription measured by GRO-seq (control 
LPS-treated versus MHD3KOLPS-treated; two-sided Wilcoxon’s P= 3.2 x 10 !”°) 
(e) or H3K27Ac levels measured by ChIP-seq (control LPS-treated versus 
MHD3KOLPS-treated; two-sided Wilcoxon’s P= 6.4 x 10”) (f) at HDAC3-bound 
sites near DA-dependent genes in control and MHD3KO macrophages, with or 
without LPS. DA-independent sites = 172, DA-dependent sites = 141. 


To test whether these predicted motifs were sufficient to differen- 
tially determine the engagement of the catalytic activity of HDAC3 for 
transcription, we transfected macrophages with reporter genes consist- 
ing of enhancer elements upstream of either Clec2d (a DA-independent 
gene containing the ATF2 motif) (Extended Data Fig. 3e) or Gas6 (a DA- 
dependent gene containing the ATF3 motif) (Extended Data Fig. 3f). LPS 
markedly stimulated the transcriptional activity ofthe ATF2 reporter inan 
HDAC3-dependent manner—which was rescued by the HDAC3(Y298F) 
mutant—only in the presence of endogenous ATF2, and independent of 
ATF3 (Extended Data Fig. 3g). Moreover, mutating the ATF2 motif to the 
ATF3 motif abrogated the LPS-stimulated induction of luciferase transcrip- 
tion (Extended Data Fig. 3g). By contrast, the ATF3 reporter had strong 
basal activity that was repressed by LPS in an HDAC3-dependent manner 
and was unaffected by depletion of ATF2; this repression was rescued only 
by catalytically active wild-type HDAC3 (Extended Data Fig. 3h). How- 
ever, loss of either ATF3 or the ATF3 binding motif de-repressed luciferase 
transcription, phenocopying the loss of HDAC3 (Extended Data Fig. 3h). 

These results predicted that ATF2 would be more abundant at 
DA-independent HDAC3-regulated sites whereas ATF3 would be prefer- 
entially associated with enzyme-active HDAC3, and this was confirmed 
by comparison of their respective cistromes (Extended Data Fig. 4a). 
ATF2 was more abundant at DA-independent HDAC3-regulated sites in 
LPS-treated macrophages (Fig. 3c), whereas ATF3 was more enriched 
at HDAC3-bound sites near DA-dependent genes (Fig. 3d). Depletion of 
ATF2 or ATF3 with multiple independent short interfering RNAs (siR- 
NAs) (Extended Data Fig. 4b, c) did not induce significant transcriptional 
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changes at the baseline (Extended Data Fig. 4d). However, loss of ATF2— 
but not ATF3—reduced HDAC3 recruitment (Fig. 3e, Extended Data 
Fig. 4e) and specifically abolished HDAC3 DA-independent transcrip- 
tional activation of LPS-induced genes (Fig. 3f). By contrast, loss of 
ATF3—but not ATF2-specifically diminished HDAC3 recruitment to 
DA-dependent enhancers (Fig. 3g, Extended Data Fig. 4f), resulting 
in de-repression of LPS-downregulated genes (Fig. 3h). These results 
suggest that differences in the enrichment of ATF2 and ATF3 genomic 
occupancy determine the dichotomous functions of HDAC3 transcrip- 
tional regulation in response to LPS. 

To gain insight into the mechanism of ATF2-mediated transcrip- 
tional activation by HDAC3, we performed cistromic profiling for 
the transcription factor p65. p65 has previously been implicated in 
HDAC3-dependent LPS signalling’*”°, and its motif is enriched at 
HDAC3-bound enhancers. LPS induced genomic binding of p65 at 
DA-independent sites (Extended Data Fig. 4g) more robustly than at 
DA-dependentsites (Extended Data Fig. 4h). p65 binding was enhanced 
by the loss of HDAC3 only at DA-dependent sites (Extended Data 
Fig. 4h), which is consistent with literature reports showing that p65 
is a direct substrate of HDAC3”° ~ and that deacetylation triggers the 
nuclear export of p65”. p65 depletion did not alter HDAC3 genomic 
binding (Fig. 3e, g), but attenuated the induction of DA-independent 
genes (Fig. 3f) upon LPS stimulation without affecting DA-dependent 
genes (Fig. 3h); this suggests a specific role for p65 recruitment in 
DA-independent activation of pro-inflammatory genes. 

Because the catalytic activity of HDAC3 requires activation by 
co-repressor complexes, we next tested whether the selective require- 
ment of HDAC3 deacetylase functions could be explained by the dif- 
ferential recruitment of NCoRI/2. Notably, genomic colocalization 
of NCoR1/2 with HDAC3 was diminished at DA-independent enhanc- 
ers of genes activated by HDAC3 (Fig. 3i, Extended Data Fig. 4i) rela- 
tive to their robust recruitment to DA-dependent targets that were 
repressed by HDAC3 (Fig. 3j, Extended Data Fig. 4j). Quantitatively, 
NCoR1/2 were enriched at sites of ATF3 binding relative to sites of ATF2 
binding (Extended Data Fig. 4k—n), providing a plausible mechanism 
for the reported immunosuppressive functions of ATF3”*”%, including 
ATF3-dependent repression of Tir4?>”’. Conversely, the depletion of 
NCoRI/2 at ATF2- and HDAC3-binding sites contributes to the physi- 
ological basis for DA-independent gene activation. 

We next tested the importance of the DA-independent functions 
of HDAC3 in regulating acute systemic inflammation. A knock-in 
HDAC3(Y298F) mouse was not available; however, we used mice in 
which HDAC3 showed reduced catalytic activity owing to mutations 
in the deacetylase-activating domain (DAD) of NCoR1/2 that abolish 
their activation of HDAC3 (NSDAD mice’). A dose of LPS that we empiri- 
cally determined to be lethal to around 50% of wild-type C57BL/6 mice 
within 24-120 h (Extended Data Fig. 5a) was lethal to NSDAD mice within 
28 h of injection (Fig. 4a), similar to previous findings in NCoRI-DAD 
mutant mice”. Correspondingly, circulating levels of pro-inflammatory 
cytokines IL6 and TNF were markedly increased in NSDAD mice 6 hafter 
LPS exposure (Extended Data Fig. 5b). By contrast, MHD3KO mice were 
largely protected from acute endotoxin lethality (Fig. 4a) and had lower 
levels of circulating pro-inflammatory cytokines compared with control 
mice (Extended Data Fig. 5b). Endogenous HDAC3 therefore contributes 
to LPS susceptibility, an effect that is magnified in the absence ofits cata- 
lytic activity; this indicates that the DA-independent and DA-dependent 
functions of HDAC3 have opposing roles in the toxicity of LPS. 

To better understand the differential LPS sensitivities of MHD3KO 
and NSDAD mice, we evaluated the transcriptomes and HDAC3 cis- 
tromes of their peritoneal macrophages 6 h after endotoxin exposure 
in vivo. Gene expression analysis (Extended Data Fig. 5c) revealed that 
the in vivo DA-independent transcriptome was highly correlated with 
its in vitro counterpart (Extended Data Fig. 5d). Loss of HDAC3 abro- 
gated the activation of LPS-stimulated DA-independent genes in both 
control and NSDAD mice (Fig. 4b, Extended Data Fig. Se), and these bore 
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Fig. 3 | Engagement of HDAC3 enzyme activity is determined by its 
differential recruitment by ATF2 and ATF3.a, Machine-learning-based TBA 
applied to HDAC3-bound 200-bp genomic regions (n =3 biological replicates) 
near DA-independent and DA-dependent genes. DNA motifs are ranked onthe 
basis of binding coefficient (negative or positive correlation with HDAC3 
binding) and significance of the motifs based on likelihood ratio test. 

b, Differential enrichment of TGACGTCA (ATF2 motif: 44/172 independent, 
0/141 dependent) or TGACTCA (ATF3 motif: 24/172 independent, 97/141 
dependent) motifs at HDAC3-bound sites near DA-independent and 
DA-dependent genes. c,d, Average density profiles in RPM of ATF2 (two-sided 
Wilcoxon’s P= 3.6 x 10”) (c) and ATF3 (two-sided Wilcoxon’s P=1.9 x10~°) (d) 
ChIP-seq data (mean +s.e.m.;n=3 biological replicates) at HDAC3-bound sites 
near either DA-independent or DA-dependent genes inLPS-treated control 


the signature of the pro-inflammatory response (Fig. 4c). By contrast, 
the catalytic activity of HDAC3 was required at in vivo DA-dependent 
genes (Extended Data Fig. 5f, g), where it exerted negative feedback 
on TLR4 signalling (Extended Data Fig. 5h). Moreover, as previously 
observed in vitro, LPS stimulated the genomic recruitment of HDAC3 
in vivo to sites near DA-independent genes (Extended Data Fig. 5i) but 
not DA-dependent genes (Extended Data Fig. 5j). Taken together, these 
results suggest that inhibition of the catalytic activity of HDAC3 might be 
detrimental, owing to persistent HDAC3-dependent activation of inflam- 
matory gene expression that must be reduced to enhance survival?’ #1. 

Because HDAC inhibitors are used in the clinic for several therapeutic 
indications, we compared the effects of pharmacological and genetic 
inhibition of the catalytic activity of HDAC3 on LPS toxicity. Consist- 
ent with previous reports of its anti-inflammatory benefits? **, low 
doses of SAHA (25-100 mg kg) conferred a protective effect against 
severe septic shock (Extended Data Fig. 6a). This is opposite to the 
phenotype of the NSDAD mice, in which HDAC3 is present but without 
deacetylase activity, and may reflect the fact that SAHA is a more potent 
inhibitor of other class I HDACs than of HDAC3*®. By contrast, daily 
administration of a higher, yet non-toxic dose of SAHA (400 mg kg") 
sensitized mice to LPS, similar to the phenotype of the NSDAD mice 
(Fig. 4d). MHD3KO mice subjected to the same treatment were pro- 
tected (Fig. 4d), suggesting that genetic or pharmacological inhibition 


macrophages. e, g, Average density profiles in RPM of HDAC3 ChIP-seq data 
(mean +s.e.m.;n=2 biological replicates) for control (siC), ATF2-depleted 
(siAtf2), ATF3-depleted (siAtf3) or p65-depleted (sip65; p6Sis also knownas 
Rela) LPS-treated macrophages at HDAC3-bound sites near DA-independent (e) 
or DA-dependent (g) genes. f, h, Heat map showing LPS-induced 
transcriptional changes—as a function of log,(FC) comparing LPS-treated and 
vehicle control (n=3 biological replicates)—of DA-independent (f) or 
DA-dependent (h) genes in control, MHD3KO, HDAC3(Y298F), ATF2-depleted 
(siAtf2), ATF3-depleted (siAtf3) or p65-depleted (sip65) macrophages. i,j, Heat 
maps of input, HDAC3, NCoR1and NCoR2 ChIP-seq data showing average 
signal (n=3 biological replicates) in RPM at HDAC3-bound sites near (i) 
DA-independent (i) or DA-dependent (j) genes. DA-independent sites =172, 
DA-dependent sites = 141. 


of the enzymatic activity of HDAC3 might sensitize mice to endotoxic 
shock through HDAC3-mediated hyperactivation of DA-independent 
pro-inflammatory mediators. Indeed, the transcriptome of 
LPS-stimulated was similar to that of macrophages from NSDAD mice 
exposed to LPS (Extended Data Fig. 6b). Moreover, this dose-dependent 
transcriptional activation was accompanied by LPS-stimulated HDAC3 
recruitment to DA-independent genes (Extended Data Fig. 6c, d) that 
did not occur at DA-dependent genes (Extended Data Fig. 6e, f). 
Although classically associated with transcriptional repression, a 
growing body of evidence suggests that HDAC3 might also function as 
acoactivator** *°°, Here we have demonstrated selective enzymatic 
engagement of HDAC3 asa function of its differential association with 
co-repressor complexes NCoR1and NCoR2, selectively coordinated by 
either ATF3 or ATF2 (Fig. 4e). Our results implicate HDAC3 as a critical 
integrator of pathogenic stimulito orchestrate well-balanced inflamma- 
tory responses. Efforts to attenuate class I HDAC enzymatic activity for 
immunomodulation should carefully consider the paradoxical intensifi- 
cation of enzyme-independent, pro-inflammatory activities of HDAC3. 
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Fig. 4| Dichotomous functions of HDAC3 orchestrate the inflammatory 
response to endotoxin in vivo. a, Kaplan-Meier survival curves of control, 
MHD3KO and NSDAD mice (n=10 independent mice) subjected to10 mgkg? 
purified LPS by intraperitoneal injection. #P = 0.014, *P= 0.024, ****P=7.1x10™, 
calculated using a two-sided Mantel-Cox test. b, Heat map showing 179 
LPS-upregulated, HDAC3-dependent differentially expressed genes in vivo 
that were rescued by NSDAD (DA-independent genes: n=4 biological 
replicates, except for KO vehicle and NSDAD LPS with n=3 biological 
replicates, DE cutoff: |log,FC| > 1, two-sided Benjamini-Hochberg-adjusted 
FDR<0.05as determined by edgeR likelihood ratio test). c, Gene set 
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Methods 


Animal studies 

MHD3KO mice® and NSDAD mice? were generated as previously 
described. Mice were housed in a temperature-controlled specific- 
pathogen-free facility under 12-h light/dark cycles. Adult male mice of 8 
weeks old were used in all experiments. Age-matched wild-type C57BL/6 
mice were purchased from Jackson Laboratories. No calculation was 
performed to pre-determine the sample size. Animal care and use 
procedures followed the guidelines of the Institutional Animal Care 
and Use Committee of the University of Pennsylvania in accordance 
with the guidelines of the National Institutes of Health. All experimental 
procedures were performed according to institutional pain class C 
protocol. 


In vivo endotoxic shock study 

All procedures were performed according to institutional pain class 
C protocol. Mice were intraperitoneally injected with 10 mg kg? 
of purified LPS (Sigma Escherichia coli 0111:B4) and closely observed 
continuously every 4 h for the first 48 h post endotoxin exposure, 
for atotal of 120 h. SAHA administration (Peprotech; 25-400 mg kg? 
every 24 h, or 10% DMSO in PBS vehicle control) was started 1 day 
before LPS injection and continued daily until the experimental end 
point. Mice of different genetic backgrounds or treatments were 
randomized, and identities were blinded from the experimenter 
until the final census. All surviving animals were humanely eutha- 
nized 120 h after injection. Experiments were performed indepen- 
dently twice. 


ELISA 

Circulating cytokine levels from mouse serum collected 6h after admin- 
istration of endotoxin were measured with Mouse Quantikine ELISA 
kits (TNF, MTAOOB; IL6, M6000B) according to the manufacturer’s 
instructions. Chemiluminescence was measured at a wavelength of 
450 nm using a Synergy HT plate reader (BioTek) and cytokine concen- 
trations were quantified using a four-parameter fit curve. 


In vivo transcription profiling 

Eight-week-old wild-type C57BL/6, MHD3KO and NSDAD mice and 
their control littermates were intraperitoneally injected with either 
phosphate-buffered saline (PBS, Sigma) or 10 mg kg™ of purified LPS 
(Sigma E£. coli O111:B4) at 16:00. Without thioglycolate stimulation, 
peritoneal macrophages were collected 6 hafter LPS or PBS injection 
at 22:00 and passed through pre-separation filters (Miltenyi Biotec, 
130-041-407) to obtain a single-cell suspension. Peritoneal mac- 
rophages were then isolated by magnetic-activated cell sorting as 
previously described”, following the manufacturer’s instructions 
(Miltenyi Biotec, 130-110-434) and using MiniMACS columns (Miltenyi 
Biotec, 130-042-201). RNA was isolated from purified cells using TRIzol 
(Invitrogen) with the RNeasy Micro kit (Qiagen). 


Cell culture 

BMDM were collected and cultured as previously described’. In brief, 
DMEM supplemented with 30% L929 (Sigma, 8501) conditioned 
medium, 10% fetal bovine serum (Tissue Culture Biologics), 100 U mI 
penicillin and 100 pg mI streptomycin (Invitrogen) was used to culture 
and differentiate cells for 7 days in Petri plates, with fresh medium 
replaced on the fourth day and replating on the sixth day after collec- 
tion. Cells were then treated with PBS (vehicle), 10 ng mI IL-4 (Pep- 
rotech) for 24 h, or 5ng ml LPS (Sigma) for 4 h. For HDAC inhibitor 
studies, a4-h pre-treatment of either vehicle control (10% DMSO) or 
25-400 nM SAHA (Peprotech) before the addition of vehicle or LPS 
was performed. All cell cultures were maintained at 37 °C in 5% CO, 
incubators. No cell line authentication or mycoplasma contamination 
testing was performed. 


Retroviral expression of exogenous HDAC3 mutant 
Sequencing-confirmed wild-type HDAC3 (WT-rescue) and 
HDAC3(Y298F) cDNAs were cloned into a murine stem cell virus 
(MSCV) retroviral construct. Retroviruses were produced in HEK-293FT 
(Invitrogen, R70007) cells by co-transfection of MSCV containing either 
no insert (empty vector), HDAC3(Y298F) or WT-rescue and pCL-Eco 
vectors with FUGENE 6 (Promega), and supernatants were collected 
72 hafter transfection. Supernatants from three 15-cm’ tissue culture 
dishes (1.0 x 10’ cells per dish) were passed through a 0.20-ym filter 
(Corning) before viral concentration with PEG-it Virus Precipitation 
Solution (System Biosciences) at 4 °C toa final stock volume of 600 pl. 
Concentrated retrovirus was added to 1.0 x 10° differentiating mac- 
rophages on day 4 (of the 7-day period) in DMEM containing 2 pg mI 
polybrene. Fresh medium containing 1 pg ml puromycin was added 
48 hlater for selection of successful viral transduction, which occurred 
over the next 4 days. We empirically determined that using 300 ul (titra- 
tion was performed at 50, 100, 200, 300, 400, 500 ul) of the concen- 
trated viral stock allows re-expression of exogeneous HDAC3 and the 
HDAC3(Y298F) mutant to a level comparable to that of endogenous 
HDAC3. To control for viral transduction, control and MHD3KO mac- 
rophages were also transduced with retroviruses carrying the empty 
vector. 


DsiRNA-mediated gene silencing 

After the 7-day period of macrophage differentiation as described 
in ‘Cell culture’, 5.0 x 10° cells were plated overnight in 10-cm tissue 
culture dishes. The media was replaced with 10 ml of serum-free, 
antibiotic-free DMEM for maximum transfection efficiency. For each 
target gene, 3 different siRNAs (dicer-substrate RNA (Dsi-RNA), IDT) 
were used as independent biological replicates. Transfection was done 
with Lipofectamine RNAiMax and siRNAs, or transfection control per 
manufacturer’s protocol, with total RNAi concentration held constant 
at 50 nM. At 48 h post-transfection, transfected BMDM were collected. 
Knockdown of specific genes was confirmed by qPCR and western blot 
before downstream experiments. Gene-specific siRNA sequences can 
be found in Supplementary Table 1. 


Dual luciferase assays 

For unbiased selection of enhancer elements for reporter genes, first, 
consensus sequences for 200-bp DNA-sequences of enhancer ele- 
ments upstream of DA-independent genes (171) and DA-dependent 
genes (142) were obtained using Clustal Omega multiple-alignment 
tool. The 200-bp enhancer element 14-kb upstream of Clec2d was 
selected for ATF2 reporter gene assay because it mapped best to the 
consensus sequence of 171 enhancer elements (multi-mapping score 
151/200 bp, allowing gaps) for DA-independent genes. Similarly, the 
200-bp enhancer element 16-kb upstream of Gas6 was computationally 
selected for ATF3 reporter gene assay (multi-mapping score 177/200 bp, 
allowing gaps) because it mapped best to the consensus sequence of 142 
enhancer elements for DA-dependent genes. A mutant reporter gene 
was generated by mutating the intrinsic ATF2 motif to an ATF3 motif for 
the ATF2 reporter, and vice versa for the ATF3 reporter. Unique flanking 
5’and 3’ extensions allowing for enzyme restriction digest were added 
as PCRanchor points. DNA cassettes were purchased as Ultramers from 
IDT, PCR-amplified, and subcloned into the pGL4.24 vector (Promega) 
with a minimal promoter using the restriction enzymes Kpnl and Xhol 
(NEB). Full-length sequences for each multimerized motif are available 
in Supplementary Table 1. For luciferase assays, BMDM were seeded at 
1.0 x 10* cells per well in a 96-well plate, co-transfected with 10 ng of 
luciferase reporter construct, 4ng of CMV-Renilla plasmid, using 0.3 pl 
of Lipofectamine 2000 per well, ina reverse transfection protocol. 
Cells were lysed in passive lysis buffer (Promega Dual Luciferase Kit) 48 
h post-transfection. Luminescence was read using a Synergy HT plate 
reader (Biotek). Four biological replicates were used in each condition. 
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Firefly luciferase signal was normalized to Renilla luciferase signal to 
account for transfection efficiency. 


Gene expression measurement 

RNA was isolated from cells using TRIzol (Invitrogen) and purified with 
the RNeasy minikit (Qiagen). qPCR was performed using 500 ng-1p1g 
of RNA (Applied Biosystems) following the manufacturer’s instruc- 
tions. qPCR was performed on cultured macrophages using the primers 
listed in Supplementary Table 1, with SYBR Green Master Mix (Applied 
Biosystems) on the QuantStudio 6 Flex Real-Time PCR system (Applied 
Biosytems). qPCR analysis was performed using the standard curve 
method, and all genes were normalized to the housekeeping gene 36b4 
(also known as RplpO). All qPCR experiments were performed with at 
least 3 biological replicates per condition. For RNA-seq, RNA integrity 
was examined using Agilent RNA 6000 Nano kit. RNA samples with 
RNA integrity number >7 (1 pg) were used for RNA cleanup and library 
preparation with Illumina Truseq Stranded Total RNA Library Prep kit 
according to the manufacturer’s instructions. All barcoded libraries 
were quantified by KAPA Library Quantification Kit (Roche), and equi- 
molarly pooled for subsequent sequencing. All RNA-seq experiments 
were performed with at least three biological replicates per condition. 
For each biological replicate, macrophages were pooled from at least 
three or four different mice. 


RNA-seq data processing 

All RNA-seq libraries were sequenced on Illumina HiSeq2000, 
Hiseq4000, Nextseq500, or Novaseq6000 platforms. Sequencing reads 
were aligned to the UCSC mm10 genome using STAR(v.2.6.0a)°* with the 
following parameters: —outFilterMultimapNmax 1-outSAMtype BAM 
SortedByCoordinate-readFilesCommand zcat-outSAMstrandField 
intronMotif. Read counts were then obtained from mapped bam files 
with featureCounts (v.1.5.3)” with the following parameters: -t exon -g 
gene _id-s1-T 20. To filter out lowly expressed genes, raw read counts 
were transformed into transcripts per kilobase million (TPM) and only 
genes with TPM > 0.1 in at least two of the biological replicates were 
kept inthe count matrix for further downstream differential expression 
analyses. edgeR* estimateDisp() and calcNormFactor() functions were 
used to estimate intrinsic dispersion and calculation normalization 
factors, respectively, to account for variations in sequencing depth. 
Normalized read counts were fit to a negative binomial generalized 
log-linear model with glmFit(). To account for changes in two variables 
simultaneously—for example, LPS and HDAC3—multi-variable design 
matrices were used. Differentially expressed (DE) genes (cut-off defined 
as |log,FC| > 1, two-sided Benjamini-Hochberg-adjusted FDR < 0.05) 
were identified using edgeR likelihood ratio test gImLRT(). 

To identify the DA-independent transcriptome, first, LPS-stimulated 
differentially expressed genes (|log,FC| > 1, FDR < 0.05) were identi- 
fied from (a) the difference between HDAC3(Y298F) versus MHD3KO: 
[(Y298F LPS — Y298F vehicle) - (MHD3KO LPS — MHD3KO vehicle)]; 
(b) the difference between control versus MHD3KO: [(control LPS - 
control vehicle) —- (MHD3KO LPS —- MHD3kO vehicle)]; c) the difference 
between WT-rescue versus MHD3KO: [(WT-rescue LPS - WT-rescue 
vehicle) - (MHD3KO LPS - MHD3KO vehicle) ]. DA-independent genes 
were classified as those that fit the null hypothesis of equality (that is, 
no difference, FDR > 0.05) for the HDAC3(Y298F)-rescue contrasts of 
[(Y298F versus MHD3KO) - (control versus MHD3KO)] and [(Y298F 
versus MHD3KO) — (WT-rescue versus MHD3KO)]. Similarly, the 
DA-dependent transcriptome fit the null hypothesis of equality for 
the contrasts of [(Y298F versus control) — (MHD3KO versus control)] 
and [(Y298F versus WT-res) — (MHD3KO versus WT-res)]. As such, for 
DA-independent genes, the HDAC3(Y298F) transcriptome was statis- 
tically modelled to be overlapping with both control and WT-rescue, 
while maximally distant to MHD3KO. Meanwhile, for DA-dependent 
genes, the HDAC3(Y298F) transcriptome was statistically modelled 
to be overlapping with MHD3KO, while maximally distant to both 


control and WT-rescue. Cumulative distribution function (CDF) plots 
for DA-independent and DA-dependenttranscriptomes were generated 
with the R package latticeExtra function ecdfplot(). Heat maps were 
generated with the R package pheatmap using z-transformed TPM or 
log,(FC) values across all biological replicates and treatment condi- 
tions. Gene ontology analyses of DA-independent, LPS-upregulated 
and DA-dependent, LPS-downregulated genes were performed using 
Enrichr*"” reporting terms with the most significant reported g values 
(<0.05). Gene set enrichment analyses were performed with GSEA 
software**“* using gene sets provided directly from https://software. 
broadinstitute.org/gsea/. 


GRO-seq and data processing 

GRO-seq was performed as previously describe with the following 
modifications: BMDM were collected from control and MHD3KO using 
three independent biological replicates per condition. For each bio- 
logical replicate, BMDM from 10 different mice were pooled together 
before being treated with either PBS control or 5ng ml of LPS for 4h 
before collection. Cells were washed twice with ice-cold PBS, then put 
incold swelling buffer (10 mM Tris at pH 7.5, 2 mM MgCl,, 3 mM CaCl,) 
for 5 min onice. Cells were scraped off and centrifuged at 400g for 10 
min. The nuclei were then extracted using lysis buffer (swelling buffer 
with 10% glycerol and 1% Igepal). After two washes with lysis buffer, 
nuclei were resuspended in freezing buffer (SO mM Tris pH 8.3, 40% 
glycerol, 5mM MgCl,, 0.1 mM EDTA). Nuclei (1.0 x 10’) were used for 
each library. The nuclear run-on reaction was incubated in run-on buffer 
(10 mM Tris pH 8.0,5 mM MgCl, 1 mM DTT, 300 mM KCI, 200 U mI? 
Superase-In, 1% Sarkosyl, 500 mM ATP, GTP and Br-UTP, 2 mM CTP) 
for 7 min at 30 °C. Nascent transcripts were enriched with anti-BrUTP 
antibodies. After 10 min of RNA hydrolysis and reverse transcription, 
all products were used for GRO-seq library preparations in parallel to 
reduce batch effect. 

GRO-seq libraries were barcoded and sequenced on the Illumina 
HiSeq4000 platform. Bidirectional eRNA identification was performed 
as previously described**’”. HDAC3-colocalized eRNAs were identified 
by overlapping all HDAC3 genomic sites with all bidirectional eRNA sites 
using bedtools*’ intersectBed. Bidirectional eRNA signal densities at 
specific HDAC3 peak regions were obtained with annotatepeaks.pl with 
a2-kb window from the centre of the HDAC3 peak, using the option -size 
2000 -hist with a 25-bp bin size. Average density profiles were gener- 
ated with mean signal in normalized reads per million (RPM) and error 
bands for biological replicates. Density profile statistics were calculated 
with a Wilcoxon rank sum test using the R function wilcox.test(), for 
two independent samples with no assumption of normal distribution 
and a null hypothesis of equal medians (x- y=y=0). 
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Western blot 

Cellular lysates were prepared by lysing cells (0.5-1.0 x 10’) inice-cold 
RIPA buffer containing protease and phosphatase inhibitors (Roche). 
For the H3K27Ac immunoblot, HDAC3 inhibitor cocktail (sodium 
butyrate, nicotinamide and trichostatin A) was also added to the lysis 
buffer. Cellular debris was removed by centrifugation (21,000g) for 
20 min at 4 °C. Protein concentration was measured with Pierce BCA 
Protein Assay Kit (Fisher, 23225) and all samples were normalized tothe 
same input protein concentration. Samples were denatured in loading 
buffer, separated using SDS-PAGE, transferred to a polyvinylidene dif- 
luoride membrane (Pierce), blocked with Odyssey TBS Blocking Buffer 
(LICOR 927-50100), and probed using target-specific primary antibod- 
ies (listed in Supplementary Table 1) at 1:1,000 dilution. Bound primary 
antibodies were detected by species-specific LICOR fluorescent IRDye 
secondary antibodies (IRDye 800CW goat anti-mouse 925-32210, lot 
C70712-11, 1:20,000 dilution; IRDye 680RD donkey anti-rabbit 926- 
68073, lot C90129-15, 1:20,000 dilution). Immunoblots were imaged 
with the Odyssey Sa Infrared Imaging System according to the manu- 
facturer’s instructions. Dual-channel colour (red/green) blots were 


converted to greyscale for colour-blind-friendly visualization pur- 
poses. Blots were cut appropriately by molecular weight whenever 
cross-species antibodies were used to reduce background (resulting 
from using mouse-derived antibodies in mouse-derived tissues. Raw 
blots can be found in Supplementary Fig. 1. 


Immunoprecipitation 

Fully differentiated BMDM or peritoneal macrophages were lysed 
with ChIP buffer (SO nM Tris-HCl pH 7.5, 500 mM NaCl, 1 mM EDTA, 1% 
Triton-X 100, 0.1% sodium deoxycholate, 0.1% SDS, supplemented with 
cOmplete Protease Inhibitor, Roche, for H3K27Ac, HDAC3 inhibitor 
cocktail with sodium butyrate, nicotinamide and trichostatin A was 
also added), incubated onice for 30 min, and centrifuged at maximum 
speed for 30 min to collect cell debris at 4 °C. Of the cleared lysates, 5% 
were saved as input. Lysates were subjected to BCA protein quantifica- 
tionto normalize the amount of input material subjected to immuno- 
precipitation. Lysates were then incubated with antibody overnight 
at 4 °C, followed by immunoprecipitation with Protein A Sepharose 
CL-4B beads (30 pl wet beads/ IP) for 6 h at 4 °C. Beads were subse- 
quently subjected to a 5-min wash with ChIP-buffer, a 5-min wash with 
ChIP-buffer with additional 500 nM NaCl, and a5-min wash with wash 
buffer (10 nM Tris-HCI pH 8.0, 250 mM LiCl, 0.5% NP-40, 10% sodium 
deoxycholate, 1 mM EDTA). 


HDAC activity assay 

Using the HDAC Assay Kit (Active Motif 56200) according to the manu- 
facturer’s instructions, the deacetylase activity of immunoprecipitated 
HDAC3 (as described in ‘Immunoprecipitation’) from control, MHD3KO, 
HDAC3(Y298F) or WT-rescue BMDM with1 mg of starting material using 
anti-HDAC3 antibody (Abcam 7030). Substrate solution containing 
short peptides with acetylated lysine residues were added directly 
to the HDAC3-antibody-Protein A bead slurry complex. After incu- 
bation with developing solution, fluorescent signals from standards 
and samples were detected using a plate reader system (Biotek Synergy 
HT), and deacetylase activity was determined by the standard curve 
method. 


ChIP-seq and data processing 
All reported ChIP-seq experiments were performed in three independ- 
ent biological replicates, except for HDAC3 ChIP-seq in ATF2-, ATF3- or 
p65-depleted cells, for which there were two independent biological 
replicates per condition. For each biological replicate, BMDM were 
pooled from at least five different mice for sufficient input material. 
BMDM (1.0 x 10’) were crosslinked ona Petri plate with 1% formaldehyde 
for 15 min at room temperature, then quenched with 2.5 M glycine 
for 5 min. Crosslinked cells were then scraped off the plates in PBS, 
collected by centrifugation (1,000g for 5 min) and lysed in 1 ml ChIP 
buffer (as described in ‘Immunoprecipitation’) and placed on ice for 
10 min. Chromatin fragmentation was performed witha microtip probe 
sonicator (Fisher Scientific, FB705 Sonic Dismembrator) at 4 °C with 
the following settings: 3x 10% amplitude for 10 s, followed by 3x 15% 
amplitude for 15s, with a 30-s pause in between each sonication. Chro- 
matin was then cleared with centrifugation at maximum speed for 
30 min at 4 °C, 5% saved for input, and incubated with antibodies (listed 
in Supplementary Table 1) overnight at 4 °C. Immunoprecipitation of 
ChIP DNA was similar to the immunoprecipitation procedure described 
in the above section, except for reversal of crosslinking overnight at 
65 °C with elution buffer (SO mM Tris-HCI pH 8.0, 10 mM EDTA, 1% 
SDS) before treatment with 0.33 mg mI! RNase A (Fisher ENO531) and 
0.5mg mI ' Proteinase K (Fisher 2546) at 37 °C for 1h. DNA was isolated 
with phenol/chloroform followed by overnight ethanol precipitation 
at —20 °C. ChIP DNA was prepared for sequencing according to the 
amplification protocol provided by Illumina. 

All ChIP-seq libraries were barcoded and sequenced on NextSeq500 
or HiSeq4000. Sequencing reads were aligned to the UCSC mm10 


genome using Bowtie2 (v.2.2.6)*? with the following parameters 


-p 20 -N1. Mapped reads were processed into tag counts, filtered for 
PCR duplicates, corrected for read-depth bias, and fragment lengths 
for each biological replicate with HOMER” functions makeTagDirec- 
tory -tbp1-fragLength 150 -totalReads 2e7. For peak calling, findPeaks 
function was used with parameter -size 200 to resize all peaks to a uni- 
form size of 200 bp. As background, input samples were used, except 
for HDAC3 for which MHD3KO was used. Peaks were called indepen- 
dently for each replicate. Only peaks that are present in at least two 
biological replicates are considered ‘true’ peaks. Peaks are defined as 
foldchange IP/input(KO) > 4, FDR < 0.0001. Genome browser tracks 
were generated with HOMER function makeUCSCfile -bigWig -fsize 
1e20 and corrected for variations in read-depth with -norm 2e7. For 
identification of HDAC3 peaks near DA-independent and DA-dependent 
genes, bedtools window -I1e5 was used on genomic location of respec- 
tive transcription start sites (obtained from https://genome.ucsc.edu) 
and HDAC3 peaks. For each gene, only the closet peak was counted. For 
peaks that happened to be close to both a DA-independent gene anda 
DA-dependent gene, whichever one was closer to the peak was assigned. 
ChIP-seq signal densities at specific peak regions were obtained with 
annotatepeaks.pl with a 2-kb window from the peak centre, using the 
option -size 2000 -hist (for histograms) or -ghist (for heat maps) with 
25-bp bin size. Average density profiles were generated with mean signal 
in normalized reads per million (RPM) and error bands for biological 
replicates. Density profile statistics were calculated with a Wilcoxon 
rank sum test using the R function wilcox.test(), for two independ- 
ent samples with no assumption of normal distribution and a null 
hypothesis of equal medians (x - y= = 0). De novo motif analyses 
were performed with the HOMER function findMotifsGenome.pl. For 
identification of motifs specific to DA-independent and DA-dependent 
sites, amachine-learning-based TBA” was applied to HDAC3-bound 
genomic regions near DA-independent genes and DA-dependent 
genes. TBA (v.1.0) was operated under recommended parameters 
(https://github.com/jenhantao/tba) with multi-threading and option 
-p to test for significance with a likelihood ratio test. For differential 
motif enrichment, DA-independent and DA-dependent genomic sites 
were cross-compared against each other as input and background, 
and vice versa. An enrichment score heat map was generated using — 
log,).(hypergeomtric Pvalue) for enrichment and log,,(hypergeometric 
Pvalue) for depletion. ChIP-seq quality controls—including the number 
of mapped reads, peak counts, and FRiP score—are provided in Supple- 
mentary Table 2. Statistically significant motifs from HOMER de novo 
motif discovery for HDAC3, ATF2, ATF3, p65, NCORI and NCoR2 are 
listed in Supplementary Table 3. 


Statistics and reproducibility 

All data were presented with individual biological replicates shown 
when appropriate. RStudio (v.1.0.153) with the R packages edgeR 
(v.3.26.8), ggplot2 (v.3.3.0), latticeExtra (v.0.6-29), pheatmap (v.1.0.12) 
softwares and GraphPad Prism (v.8.0) were used for graphing and statis- 
tical analysis. Genome-browser tracks were visualized with IGV 2.3.92". 
All statistical tests are fully described in the figure legends. In general, 
Pvalues were calculated using two-tailed Student’s t-test for pairwise 
comparisons and one-way ANOVA for multiple comparisons. All statisti- 
cal tests were adjusted using the Holm-Sidak method for multiple test- 
ing, witha 95% confidence interval. Hypergeometric testing was either 
reported by HOMER (v.4.11)* or Enrichr (v.2.1)*!”, or was performed 
with the R function phyper(). Statistics reported by Enrichr (g values) 
were adjusted using the Benjamini-Hochberg method. Correlations 
with associated P values from two-sided t-tests were calculated using 
the R function cor(). 

The following experiments were performed independently twice 
with similar results: Fig. 1a, b (HDAC3 gene and protein expression), 
Extended Data Fig. 2e (H3K27Ac western blot) and Extended Data Fig. 6a 
(in vivo endotoxin susceptibility in SAHA-treated mice). 
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The following experiments were performed once: Fig. Ic (HDAC3 
catalytic activity), Fig. l1d-f and Extended Data Fig. la—h (RNA-seq), 
Figs. 2a, b,3a, band Extended Data Figs. 2a, b, h, 3a, b (HDAC3 ChIP-seq), 
Fig. 2c, e and Extended Data Figs. 2c, f, g, 3d (GRO-seq), Fig. 2d, f and 
Extended Data Fig. 2a, d (H3K27Ac ChIP-seq), Fig. 3c, d and Extended 
Data Fig. 4a, m, n (ATF2 and ATF3 ChIP-seq), Fig. 3e-—h and Extended 
Data Fig. 4d-f (HDAC3 ChIP-seq and RNA-seq in siAtf2/siAtf3/sip65 
macrophages), Fig. 3i, j and Extended Data Fig. 4i-I (NCoR1/2 ChIP-seq), 
Extended Data Fig. 3e-h (luciferase assays), Extended Data Fig. 4b, c 
(ATF2/ATF3/p65 expression), Extended Data Fig. 4g, h (p65 ChIP-seq), 
Fig. 4a (in vivo endotoxin susceptibility in genetic mouse models), 
Fig. 4b, c and Extended Data Fig. 5c-j (in vivo RNA-seq and HDAC3 
ChIP-seq), Fig. 4d (SAHA endotoxin susceptibility in genetic model), 
Extended Data Fig. 5a, b (endotoxin susceptibility and cytokine meas- 
urement), Extended Data Fig. 6b-f (HDAC3 ChIP-seq and RNA-seq in 
SAHA treated macrophages). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All sequencing data from RNA-seq, ChIP-seq and GRO-seq analyses 
have been deposited to the Gene Expression Omnibus under accession 
number GSE140611. ChIP-seq data for Fos, JunB, JunD and CREB in 
LPS-stimulated BMDMs were obtained from GSE99895. All other data 
are available from the corresponding author upon reasonable request. 
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Extended Data Fig. 1|See next page for caption. 


Article 


Extended Data Fig. 1| DA-independent and DA-dependent functions of 
HDAC3 inthe inflammatory response to LPS. a, Scatter plots of RNA-seq 
experiments in control, MHD3KO, HDAC3(Y298F) and WT-rescue BMDM, with 
or without LPS, showing correlation between biological replicates using either 
log, transformation (log,(x+1)), variance stabilizing transformation (vst), or 
regularized-logarithm transformation (rlog) of normalized read counts. 

b, Principal component analysis (PCA) plot using the vst values from RNA-seq 
experiments in control, MHD3KO, HDAC3(Y298F) and WT-rescue BMDM, with 
or without LPS (n=4 biological replicates). c, Heat map of sample-to-sample 
distances using the vst values from RNA-seq experiments in control, MHD3KO, 
HDAC3(Y298F) and WT-rescue BMDM, with or without LPS (n =4 biological 
replicates). d, Empirical cumulative distribution function (CDF) and associated 
two-sided Kolmogorov-Smirnov test D statistics of LPS-stimulated changes in 
statistically modelled DA-independent gene expression (385 genes) for 
control, MHD3KO, HDAC3(Y298F) and WT-rescue BMDM (n=4 biological 


replicates). e, Empirical CDF and associated two-sided Kolmogorov-Smirnov 
test Dstatistics of LPS-stimulated changes in statistically modelled DA- 
dependent gene expression (377 genes) for control, MHD3KO, HDAC3(Y298F) 
and WT-rescue BMDM (n= 4 biological replicates). f, Heat map showing 142 
LPS-downregulated genes that were rescued by wild-type HDAC3 but not by 
HDAC3(Y298F) (DA-dependent genes: n=4 biological replicates, DE cutoff: 
|log,FC| >1, two-sided Benjamini-Hochberg-adjusted FDR<0.05as 
determined by edgeR likelihood ratio test). g, Gene ontology analysis of 142 
LPS-downregulated genes (n= 4 biological replicates). g-values represent 
Benjamini-Hochberg-adjusted one-sided hypergeometric P values for over- 
representation as determined by Enrichr. h, LPS-stimulated (762 genes) and 
IL4-stimulated (405 genes) transcriptomic correlation of MHD3KO and 
HDAC3(Y298F), compared with control BMDM (n=4 biological replicates). 
Heat map showing Spearman’s correlation p values with corresponding 
two-sided Pvalues calculated by t-tests. 
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Extended Data Fig. 2 | Differential recruitment and enhancer activity of 
HDAC3 at LPS-responsive genes. a, Scatter plots of ChIP-seq experiments for 
HDAC3 and H3K27Ac in vehicle or LPS-treated control BMDM (n= 3 biological 
replicates) showing correlation between biological replicates using tag counts 
per base-pair (TPB) at identified peak regions. Pearson’s Pvalues were 
calculated with two-sided t-tests. b, Average density profiles in RPM of HDAC3 
ChIP-seq showing mean +s.e.m. (n=3 biological replicates) at all identified 
HDAC3 peaks (10,966 total) in control and MHD3KO macrophages, with or 
without LPS. c, Average density profiles in RPM of GRO-seq showing 

mean +s.e.m. (n=3 biological replicates) at all identified eRNAs (12,192 total) in 
control and MHD3KO BMDM, with or without LPS. d, Average density profiles 
in RPM of H3K27Ac showing mean +s.e.m. (n=3 biological replicates) at all 
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identified H3K27Ac peaks (50,247 total) in control and MHD3KO macrophages, 
with or without LPS. e, Western blot (performed independently twice) of 
HDAC3 and H3K27Ac protein levels in control and MHD3KO macrophages, with 
vinculin and histone H3 as loading controls. f,g, Genome-browser tracks 
showing three biologically replicated examples of enhancer and gene body 
activity as measured by GRO-seq relative to HDAC3 ChIP-seq peaks in control 
and MHD3KO macrophages, with or without LPS near DA-independent (f) or 
DA-dependent (g) genes. h, Top de novo enriched motifs at HDAC3-bound 
genomic regions (10,966 sites from n=3 biological replicates). Statistics were 
determined by HOMER with one-sided hypergeometric Pvalues for 
over-representation. 
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Extended Data Fig. 3|See next page for caption. 
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Extended Data Fig. 3 | ATF2 and ATF3 differentially mediate HDAC3 
transcriptional effects at DA-independent and DA-dependent sites, 
respectively. a, Comparison of the performance of TBA classifiers modelled 
against DA-independent and DA-dependent 200-bp-DNA sequences as 
measured by the area under the receiver operating characteristic curve 
(auROC, n=5 independent train-test iterations, datashownas mean +s.d.). 

b, Heat map showing relative enrichment of several AP-1 family member DNA 
motifs as determined by TBA with likelihood ratio test against DA-independent 
and DA-dependent 200-bp-DNA sequences across 5 train-test iterations. The 
non-redundant, merged motifs from TBA include ATF1, 4, 5, 6, 7,Jun family, Fos 
family, and other related bZIP factors. c, Average density profiles in RPM of Fos, 
JunB, JunD and CREB ChIP-seqs obtained from Gene Expression Omnibus 
database at HDAC3-bound sites near either DA-independent or DA-dependent 


genes inLPS-treated BMDM. d, Average density profiles in RPM of bidirectional 
eRNA transcription measured by GRO-seq showing mean +s.e.m. (n=3 
biological replicates) at HDAC3-bound enhancers with ATF2 motif (1,680 sites) 
or ATF3 motif (3,673 sites) (two-sided Wilcoxon’s P=8.6 x 10°”) in LPS-treated 
BMDM.e, f, Genome-browser tracks showing HDAC3, ATF2 and ATF3 ChIP-seq 
peaks at enhancer elements upstream of Clec2d (e) or Gas6 (f), as wellas gene 
body activity as measured by RNA-seq in control, MHD3KO, HDAC3(Y298F) and 
WT rescue macrophages, with or without LPS. g, h, Dual-luciferase assays of 
transcription driven by the Clec2d (g) or Gas6 (h) enhancers incontrol, 
MHD3KO, HDAC3(Y298F) rescue, WT-rescue (WT), ATF2-depleted and ATF3- 
depleted BMDM, with or without LPS stimulation. Datashown are mean +s.d., 

n=4 biological replicates. Pvalues were calculated by one-way ANOVA. DA- 
independent sites = 172, DA-dependent sites = 141. 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4| ATF2 and ATF3 recruit HDAC3 to sites near DA- 
independent and DA-dependent genes, respectively. a, Scatter plots of 
ChIP-seq experiments for ATF2, ATF3, NCoR1, NCoR2 and p65 inLPS-treated 
BMDM (n=3 biological replicates) showing correlation between biological 
replicates using average tag counts per base-pair (TPB) at identified peak 
regions. Pearson’s Pvalues were calculated with two-sided t-tests. b, Relative 
gene expression levels of Atf2, Atf3 and p65 (also knownas Rela) as measured 

by qPCR in control (siC), ATF2-depleted (siAtf2), ATF3-depleted (siAtf3), or 
p65-depleted (sip65) BMDM (n= 3 biological replicates). Datashown 

mean +s.d.,Pvalues calculated by two-sided Student’s t-test. c, Western blots 
(performed once) of ATF2, ATF3 and p65 protein levels with vinculin as loading 
control for control (siControl), ATF2-depleted (siAtf2), ATF3-depleted (siAtf3) 
or p65-depleted (sip65) BMDM, each with 3 independent siRNAs. d, Heat map 
of sample-to-sample distances using the vst values from RNA-seq experiments 
in control (siC), ATF2-depleted (siAtf2), ATF3-depleted (siAtf3) or p65-depleted 
(sip65) BMDM, with or without LPS (n=3).e, f, Genome-browser tracks showing 
biologically replicated examples of HDAC3 ChIP-seq peaks in control 
(siControl), ATF2-depleted (siAtf2), ATF3-depleted (siAtf3) or p65-depleted 


(sip65) LPS-stimulated BMDM near DA-independent (e) or DA-dependent (f) 
genes. g, h, Average density profiles in RPM of p65 ChIP-seq showing 

mean +s.e.m. (n=3 biological replicates) at HDAC3-bound sites near DA- 
independent genes (Control LPS versus MHD3KO LPS two-sided Wilcoxon’s 
P=0.063) (g) or DA-dependent genes (Control LPS versus MHD3KO LPS two- 
sided Wilcoxon’s P= 2.4 x 10) in control and MHD3KO BMDM, with or without 
LPS (h).i,j, Box-and-whisker plot showing minimum, maximum, median, first 
quartile and third quartile from quantification of average ChIP-seq signal 
(n=3 biological replicates) in RPM for input, HDAC3, NCoR1and NCoR2 at 
HDAC3-bound sites near DA-independent (i) or DA-dependent (j) genes. 
Pvalues were calculated with two-sided Mann-Whitney test. k, Il, Average 
density profiles in RPM showing mean +s.e.m. (n=3 biological replicates) of 
NCoRI1 (two-sided Wilcoxon’s P= 4.7 x 10“) (k) and NCoR2 (two-sided 
Wilcoxon’s P=1.2 x 10) (I) genomic colocalization with either ATF2 (19,594 
peaks) or ATF3 (57,041 peaks). m,n, Genome-browser tracks showing three 
biologically replicated examples of ChIP—seq peaks for HDAC3, ATF2, ATF3, 
NCoRI, and NCoR2 in LPS-stimulated BMDM near DA-independent (k) or 
DA-dependent (I) genes. DA-independent sites = 172, DA-dependent sites = 141. 
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Extended Data Fig. 5| Loss of HDAC3 protein but not deacetylase activity 
protects mice from acute endotoxic shock. a, Kaplan-Meier curves of 
wild-type C57BL/6 mice injected with increasing doses of LPS and observed for 
120h(n=4 independent mice). b, Serum cytokine concentrations measured by 
ELISA for IL6 and TNF in10 mg kg! LPS-injected control, MHD3KO, NSDAD, 
C57BL/6 mice (n=5 independent mice) administered with increasing doses of 
SAHA (25,100, 400 mg kg’), and untreated control (n=3 independent mice). 
Data shown mean +s.d. Pvalues calculated by one-way ANOVA. c, PCA plot 
using the vst values from RNA-seq experiments of in vivo peritoneal 
macrophages from control, MHD3KO and NSDAD mice injected with vehicle 
control (PBS) or10 mg kg! LPS (n=4 biological replicates, except for KO vehicle 
and NSDADLPS withn =3 biological replicates). d, Scatter plot showing 
correlation between in vitro (n=4 biological replicates) and in vivo (n=3 


biological replicates) DA-independent differential gene expression (174 genes). 


Pearson’s Pvalue was calculated with two-sided t-test. e, Empirical CDF and 
associated two-sided Kolmogorov-Smirnov test D statistics of LPS-stimulated 
changes in statistically modelled DA-independent in vivo gene expression 
(251genes) for peritoneal macrophages from control (n=4 biological replicates), 
MHD3KO (n=4 biological replicates) and NSDAD (n=3 biological replicates). 

f, Heat map showing 177 LPS-downregulated, HDAC3-dependent differentially 


expressed genes in vivo that were not rescued by NSDAD (DA-dependent genes: 
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n=4 biological replicates, except for KO vehicleand NSDAD LPS withn =3 
biological replicates; DE cutoff: |log,FC| >1, two-sided Benjamini- 
Hochberg-adjusted FDR < 0.05 as determined by edgeR likelihood ratio test). 
g, Empirical CDF and associated two-sided Kolmogorov-Smirnov test D 
statistics of LPS-stimulated changes in statistically modelled DA-dependent 
in vivo gene expression (404 genes) for peritoneal macrophages from control 
(n=4 biological replicates), MHD3KO (n=4 biological replicates) and NSDAD 
(n=3 biological replicates). h, Gene set enrichment analysis (GSEA) for the 

in vivo DA-dependent transcriptome (177 genes) showing enrichment of KEGG 
Toll-like receptor signalling pathway that was more upregulated in peritoneal 
macrophages from LPS-treated NSDAD mice (n=3 biological replicates) than 
from LPS-treated control mice (n=4 biological replicates). Bar graph showing 
Tlr4 gene expression level innormalized counts of peritoneal macrophages 
from control and NSDAD treated with LPS. Data shown mean +s.d., two-sided 
Pvalue was calculated by Student’s t-test. i,j, Average density profiles in RPM of 
HDAC3 ChIP-seq showing mean +s.e.m. (n=3 biological replicates) at 
HDAC3-bound sites near 167 in vivo DA-independent genes (two-sided 
Wilcoxon’s P=1.7 x 10%) (i) or 165 DA-dependent genes (two-sided Wilcoxon’s 
P=0.153) (j) in peritoneal macrophages from vehicle- or LPS-treated control 
mice. 
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Extended Data Fig. 6| Dose-dependent effects of HDAC inhibitor SAHA on 
endotoxin susceptibility. a, Kaplan-Meier curves of C57BL/6 mice subjected 
to10 mg kg? of intraperitoneal injection of purified LPS, with increasing doses 
of SAHA (25, 100, 400 mg kg”). SAHA vehicle (veh), 10% DMSO in PBS. *P= 0.036, 
**P=0.001, ***P=0.00065,n=10 independent mice, calculated by two-sided 
Mantel-Cox test. b, Heat map showing LPS-induced transcriptional changes as 
a function of log,(fold change) (log,FC) of 179 in vivo DA-independent genes in 
LPS-exposed BMDM treated with increasing dose of SAHA (25,100,400 nM, 
n=3 biological replicates) and peritoneal macrophages from LPS-exposed 
MHD3KO (n=4 biological replicates) and NSDAD mice (n =3 biological 
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replicates). c, Average density profiles in RPM of HDAC3 ChIP-seq showing 
means of 3 biological replicates at HDAC3-bound sites near 172 
DA-independent genes. d, Genome-browser tracks showing 3 biologically 
replicated examples of ChIP-seq peaks for HDAC3 in LPS-stimulated BMDM 
treated with increasing dose of SAHA (25, 100, 400 nM) near DA-independent 
genes. e, Average density profiles in RPM of HDAC3 ChIP-seq showing means 
of 3 biological replicates at HDAC3-bound sites near 141 DA-dependent genes. 
f, Genome-browser tracks showing 3 biologically replicated examples of 
ChIP-seq peaks for HDAC3 in LPS-stimulated BMDM treated with increasing 
dose of SAHA (25, 100, 400 nM) near DA-dependent genes. 
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x| The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


x A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 
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Data collection Sequencing data, including RNA-seq, GRO-seq, and ChIP-seq, were prepared according to previously published and cited methodology 
with detailed description of any modification in Methods section, and generated with Illumina Hiseq2000, Hiseq4000, Nextseq500, or 
Novaseq 6000. Western blot images were scanned and analyzed by LICOR ImageStudioLite (v 5.2.5) 


Data analysis Graphpad Prism (v8.0) and RStudio (v1.0.153) softwares were used for graphing and statistical analysis. Genome-browser tracks were 
generated with IGV (v2.3.92). R packages used in this study: edgeR(v3.26.8), latticeExtra(vO.6-29), pheatmap (v1.0.12), ggplot2(v3.3.0). 
STAR(v2.6.0a) was used for RNA-seq mapping. FeatureCounts (v1.5.3) was used for counting reads assigned to exons. GSEA (v4.0.3) was 
used for gene-set enrichment analysis. Bowtie2(v2.2.6) was used for ChIP-seq mapping. HOMER (v4.11) was used for ChIP-seq and GRO- 
seq peak calling, motif analysis, as well as obtaining read density from processed bed files. Bedtools (v2.25.0) was used to intersect bed 
files of genomic regions. TBA (v1.0) was used for machine learning-based motif prediction analysis. Gene ontology analyses were 
performed with Enrichr(v2.1). Every step of the analyses was done according instructions provided with each package, with detailed 
description provided in Methods section without any custom codes or algorithms. 
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Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


All sequencing data, including RNA-seq, ChIP-seq, and GRO-seq were deposited to Gene Expression Omnibus under accession number GSE140611 with access code 
available to reviewers upon request. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size At least 3 biological replicates were used for each experiment. In our experience, this number is the minimum required to account for the 
variability commonly observed in the murine bone-marrow derived macrophages, which is commonly accepted in the field. In cases where a 
large amount of input material or consistency of experimental conditions was required, as in the case of RNA-seq, GRO-seq and ChIP-seq, we 
had to pool materials from 3-10 mice, as detailed in Methods section, for each biological replicate. For animal studies, 8-10 mice were used 
for each experimental group, as detailed in the figure legends for each experiment as well as Method section. No calculation was performed 
to pre-determine the sample size. 


Data exclusions For RNA-seq, genes that are < 0.1 TPM were discarded as too lowly expressed. For ChIP-seq, all peaks have a required cut-off of 1 RPM. For 
GRO-seq, we also discarded reads mapped to highly abundant transcripts such as rRNA, as those rarely represents true nascent transcription 
of target of interest. Reads that mapped twice to the same regions were also removed, as they are likely artifacts of PCR amplification. All of 
these exclusion criteria were pre-established as previously published (Fang et al., Cell 2014; Emmet et al., Nature 2017). All samples were 
subjected to the same normalization pipeline to minimize variability in sequencing depth. We did not otherwise discard any data from any 
other experiments in this study. 


Replication All experiments were performed in at least 3 independent biological replicates, and repeated at least one more time independently except for 
the case of RNA-seq, ChIP-seq and GRO-seq, where we only profiled the independent biological replicates once due to cost and material 
constraints. However, we did cross-compare our data with publicly available datasets from other labs and observed significant overlaps. 


Randomization _ For in vivo studies, we randomized mice of different genetic backgrounds as well as treatment conditions into different groups to account for 
variability in treatment exposure and stress level in each group. 


Blinding The author was blinded from the identities of mice used in the in vivo experiments by an independent lab member until final mortality census 
of the animals. For preparation of high-throughput sequencing libraries, the author processed the samples in a random order without blinding 
as all samples were subjected to the same library preparation procedure, and hence blinding at this step was irrelevant. For each study, all 
samples were processed at the same time to reduce bias and batch effect. 
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Antibodies 


Antibodies used List of antibodies used can also be found in Supplementary Table 1. HDAC3(Abcam, monoclonal 7030, lot# GR121157-6, 1:200 
dilution for IP, 1:1000 dilution for WB ), p65 (Cell signaling, monoclonal D14E12, lot# 13, 1:200 dilution for IP, 1:1000 dilution for 
WB), Vinculin (Sigma, monoclonal V9131, lot#128M4852V, 1:2000 dilution for WB), H3K27Ac (Abcam 4729, lot# GR251958-1, 
:200 dilution for IP, 1:1000 dilution for WB), ATF2 (Cell Signaling, D4L2X, lot# 1, 1:200 dilution for IP, 1:1000 dilution for WB), 
ATF3 (Cell Signaling, D2Y5W, lot# 1, 1:200 dilution for IP, 1:1000 dilution for WB) , NCoR1 (Cell Signaling, E4S4N, lot# 1, 1:200 
dilution for IP), NCoR2 (Millipore AntisMRTe 06-891, lot# 2982400, 1:200 dilution for IP), Histone H3(Abcam, 24834, lot# 
GR236539, 1:1000 dilution for WB) 


BR 


Validation HDAC3 (Abcam 7030) was validated to work for both Western Blot (WB) and Chromatin immuno precipitation (ChIP) in mouse 
cell lines as detailed on the manufacturer's website (https://www.abcam.com/hdac3-antibody-chip-grade-ab7030.html), as well 
as previously published for WB and ChIP in mouse tissues from our lab such as BAT (Emmet et al., Nature 2017), pancreas 
(Remsberg et al., Mol metabolism 2017), liver (Armour et al, Nat Comm 2017) 
p65 (Cell signaling D14E12) was validated to work with WB in multiple human and mouse cell lines and ChIP in Hela cell as 
detailed on the manufacturer's website (https://www.cellsignal.com/products/primary-antibodies/nf-kb-p65-d14e12-xp-rabbit- 
mab/8242). 

ATF2 (Cell Signaling D4L2X) was validated to work in human cell lines for WB and ChIP as detailed on the manufacturer's website 
(https://www.cellsignal.com/products/primary-antibodies/atf-2-d412x-xp-rabbit-mab/35031), and was reported to work with 
mouse RAW264 cell line for WB (Kim et al., BMC Complementary and Alternative Medicine 2019) 

ATF3 (Cell Signaling, D2Y5W) was validated to work in human cell lines for WB and ChIP as detailed on the manufacturer's 
website (https://www.cellsignal.com/products/primary-antibodies/atf-3-d2y5w-rabbit-mab/33593), and was reported to work 
with human breast cancer cell line for WB (Subbaramaiah et al., JBC 2019) 

CoR1 (Cell Signaling, E4S4N) was validated to work in LS180 human cell line for ChIP as detailed on the manufacturer's website 

(https://www.cellsignal.com/products/primary-antibodies/ncor1-e4s4n-rabbit-mab/34271) 

CoR2 (Millipore AntiSMRTe 06-891) was validated to work in human Hela cell line for ChIP as detailed on the manufacturer's 

website (https://www.emdmillipore.com/US/en/product/Anti-SMRTe-Antibody, MM_NF-06-891#documentation) 

H3K27Ac (Abcam 4729) was validated to work in mouse MEF cell line for WB and human Hela cell line for ChIP as detailed on the 

manufacturer's website (https://www.abcam.com/histone-h3-acetyl-k27-antibody-chip-grade-ab4729.html), as well as 

previously published for WB and ChIP in mouse tissues from our lab such as BAT (Emmet et al., Nature 2017) 

Histone H3 was reported to work in human Hela cell line, and mouse embryonic fibroblast and testis cell lines for WB as detailed 

on the manufacturer's website (https://www.abcam.com/histone-h3-antibody-mabcam-24834-nuclear-loading-control-and- 

chip-grade-ab24834.html), as as well as previously published for WB and ChIP in mouse tissues such as BAT (Emmet et al., 

ature 2017) 

Vinculin (Abcam, 24834) was reported to work for WB in multiple human and mouse cell lines as detailed on the manufacturer's 

website (https://www.sigmaaldrich.com/catalog/product/sigma/v9131?lang=en&region=US) as well as previously published for 

WB in mouse tissues from our lab such as BAT (Emmet et al., Nature 2017), liver and embryonic stem cell (Adlanmerini et al., 

PNAS 2019, Dierickx et al., 2019) 
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Cell line source(s) Primary murine bone-marrow derived macrophages, HEK-293FT from Invitrogen (R70007), L929 (Sigma, 8501). 
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Mycoplasma contamination We do not test for mycoplasma as we mostly work with murine-derived primary tissues that do not have high passage 
number. 
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(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 
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weeks old were used in all experiments. Age-matched WT C57BI6 mice were purchased from Jackson lab. 


Wild animals Study did not involve any wild animals. 
S) 
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Field-collected samples No samples were collected from the field. S 
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Ethics oversight Animal care and use procedures followed the guidelines of the Institutional Animal Care and Use Committee of the University of = 


Pennsylvania in accordance with the guidelines of the National Institutes of Health. All procedures were performed according to 
institutional pain class C protocol. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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The majority of therapies that target individual proteins rely on specific activity- 
modulating interactions with the target protein—for example, enzyme inhibition or 
ligand blocking. However, several major classes of therapeutically relevant proteins 


have unknown or inaccessible activity profiles and so cannot be targeted by such 
strategies. Protein-degradation platforms such as proteolysis-targeting chimaeras 
(PROTACs)’” and others (for example, dTAGs’, Trim-Away*, chaperone-mediated 
autophagy targeting’ and SNIPERs‘) have been developed for proteins that are 
typically difficult to target; however, these methods involve the manipulation of 
intracellular protein degradation machinery and are therefore fundamentally limited 
to proteins that contain cytosolic domains to which ligands can bind and recruit the 
requisite cellular components. Extracellular and membrane-associated proteins—the 
products of 40% of all protein-encoding genes’—are key agents in cancer, 
ageing-related diseases and autoimmune disorders’, and so a general strategy to 


selectively degrade these proteins has the potential to improve human health. Here we 
establish the targeted degradation of extracellular and membrane-associated proteins 
using conjugates that bind both a cell-surface lysosome-shuttling receptor and the 
extracellular domain of a target protein. These initial lysosome-targeting chimaeras, 
which we term LYTACs, consist of a small molecule or antibody fused to chemically 
synthesized glycopeptide ligands that are agonists of the cation-independent 
mannose-6-phosphate receptor (CI-M6PR). We use LYTACs to develop a CRISPR 
interference screen that reveals the biochemical pathway for CI-M6PR-mediated cargo 
internalization in cell lines, and uncover the exocyst complex as a previously 
unidentified—but essential—component of this pathway. We demonstrate the scope of 
this platform through the degradation of therapeutically relevant proteins, including 


apolipoprotein E4, epidermal growth factor receptor, CD71 and programmed 
death-ligand 1. Our results establish a modular strategy for directing secreted and 
membrane proteins for lysosomal degradation, with broad implications for 
biochemical research and for therapeutics. 


Unlike the proteasomal pathway, the lysosomal pathway for protein 
degradation is not limited to proteins that have intracellular domains. 
Families of cell-surface lysosome-targeting receptors (LTRs) have 
been reported that facilitate the transport of proteins to lysosomes’. 
We proposed that chimeric molecules capable of binding both a 
cell-surface LTR and an extracellular protein might induce internali- 
zation and lysosomal degradation of the target, offering a means to 
accelerate the degradation of proteins by using binders that act inthe 
extracellular space. The prototypical LTR is the cation-independent 
mannose-6-phosphate receptor (CI-M6PR, also called IGF2R), which 
endogenously transports proteins bearing N-glycans capped with 
mannose-6-phosphate (M6P) residues to lysosomes”. The receptor 
cycles constitutively between endosomes, the cell surface and the 
Golgi complex. CI-M6PR shuttles cargo efficiently to prelysosomal 


compartments, where a lowered pH enables cargo to dissociate and 
progress to the lysosome while CI-M6PR is recycled. On the basis of its 
ability to efficiently deliver proteins to lysosomes and its expression 
ina range of different tissue types, CI-M6PR has been exploited to 
deliver therapeutic enzymes for the treatment of lysosomal storage 
disorders". Here we present lysosome-targeting chimaeras—which 
we term LYTACs (Fig. 1a)—that enable the depletion of secreted and 
membrane-associated proteins via a mechanism of action that is 
orthogonal and complementary to that of existing technologies”, 
and which does not rely on an extracellular protease for efficacy”. 
We show that LYTACs can serve as biochemical probes for the study 
of receptor trafficking and protein degradation, and can mediate the 
degradation of both secreted and membrane-associated proteins of 
therapeutic interest. 


‘Department of Chemistry, Stanford University, Stanford, CA, USA. “Howard Hughes Medical Institute, Stanford, CA, USA. “e-mail: bertozzi@stanford.edu 
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Fig. 1| LYTACs using CI-M6PR traffic proteins to lysosomes. a, The concept 
of LYTACs, in which a glycopolypeptide ligand for CI-M6PR is conjugated to an 
antibody to traffic secreted and membrane-associated proteins to lysosomes. 
b, Synthesis of M6Pn glycopolypeptide ligands for CI-M6PR. bpy, bipyridyl; 
COD, cyclooctadiene; RT, room temperature; THF, tetrahydrofuran; TMS, 
trimethylsilyl. c, Assay for the internalization of NA-647 by biotin-based 
LYTACs. d, Panel of synthetic M6P and M6Pn glycopolypeptides and controls. 
L, long; S, short. e, Fold changes in mean fluorescence intensity (MFI) for K562 
cells incubated at 37 °C for 1h with S00 nM NA-647 or 500 nM NA-647 and 2 uM 


To develop ligands for CI-M6PR, we leveraged precedents aimed at 
enhancing lysosomal enzyme replacement therapies and drug deliv- 
ery platforms. This previous work included biosynthetic engineering 
of M6P-bearing glycans“ as well as chemical synthesis of oligomeric 
Mé6P-containing scaffolds”. A recurring design parameter from these 
studies was the requirement for a multivalent ligand, in order to achieve 
optimal CI-M6PR agonism’*®, Previous work had also revealed that the 
6-phosphoester could undergo hydrolysis in human serum”, leading 
to rapid endocytosis by macrophages bearing mannose receptors”°. 
We reasoned that N-carboxyanhydride (NCA)-derived glycopolypep- 
tides” bearing multiple serine-O-mannose-6-phosphonate (M6Pn) 
residues” would enable multivalent presentation ona biocompatible, 
phosphatase-inert”? and modular scaffold. We synthesized M6Pn gly- 
copolypeptides starting with the conversion of mannose pentaacetate 
to M6Pn-NCA in 13 steps (Fig. 1b, Extended Data Fig. 1a). Subsequent 
copolymerization of M6Pn-NCA andalanine-NCA (1:1 ratio, for the pur- 
pose of spacing and polymerization kinetics) provided access to M6Pn 
glycopolypeptides (post-deprotection dispersity, )=1.3-1.5) of various 
lengths, including short (20 M6Pn) and long (90 M6Pn) variants, to test 
in protein degradation assays. We also synthesized the corresponding 
M6P-containing copolymers bearing the natural phosphorylated gly- 
can structure (Extended Data Fig. 1b), with similar molecular weights 
and dispersities to those obtained with M6Pn monomers. 

To demonstrate the feasibility of CI-M6PR-driven LYTACs, we 
designed an incellulo assay to measure the uptake of NeutrAvidin-647 
(NA-647), an Alexa Fluor-647 (AF647)-labelled protein to which bio- 
tin binds (Fig. 1c). We synthesized and biotinylated poly(M6P) and 
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biotinylated glycopolypeptide in complete growth media. MFI was determined 
by live-cell flow cytometry. f, Live-cell confocal microscopy images of K562 
cells treated as ine, then labelled with LysoTracker Green for 30 min. Scale bar, 
10 um. g, Panel of cell lines for NA-647 uptake experiments, performed asine. 
Fore, g, dataare mean +s.d. of three independent experiments. For f, images 
are representative of two independent experiments. Pvalues were determined 
by unpaired two-tailed t-tests; fold changes are reported relative to incubation 
with protein targets alone. NS, not significant. 


poly(M6Pn) polypeptides, as well as poly(N-acetylgalactosamine) 
(poly(GalNAc)) and poly(mannose) as controls (Fig. 1d, Extended Data 
Figs. 1c, d, 2a). K562 cells were incubated with either NA-647 or NA-647 
and biotinylated glycopolypeptide for one hour, then washed and 
analysed by flow cytometry. Co-incubation with M6P and M6Pn poly- 
peptides increased cellular fluorescence by 5-6-fold compared with the 
background, with only minor differences in uptake efficiency observed 
as aresult of glycopolypeptide length (short versus long, Fig. le). M6Pn 
polypeptides showed performance that was equivalent or superior to 
that of M6P polypeptides of similar length, whereas incubation with 
mannose: or GalNAc-containing glycopolypeptides showed no marked 
change in fluorescence compared with the background. NA-647 uptake 
mediated by poly(M6Pn) was attenuated by co-incubation with excess 
exogenous M6P. Furthermore, uptake remained continuous over time 
(Extended Data Fig. 2b), suggesting that surface-receptor recycling was 
the rate-limiting step”*. AF647 is reported to be stable in endosomes 
and lysosomes”, with a steadily increasing fluorescent signal arising 
from intracellular fluorophore accumulation. Live-cell fluorescence 
microscopy experiments revealed that the AF647 signal co-localized 
with acidic endosomes and lysosomes after only 1h (Fig. 1f, Extended 
Data Figs. 2c, d). Finally, biotinylated LYTACs mediated NA-647 uptake 
in various cell lines, demonstrating the breadth of CI-M6PR-targeting 
(Fig. 1g) as well as the ability to use small molecules as LYTAC ‘warheads’. 

We next performed a CRISPR interference (CRISPRi) pooled genetic 
screen with the aim of identifying genes for which knockdown ablates 
the delivery of NA-647 by biotinylated LYTACs”*” (Fig. 2a). K562 cells 
expressing dCas9-KRAB were transduced witha genome-wide library 
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Fig. 2| CRISPRiscreen identifies key cellular machinery for LYTACs. 

a, Schematic of a CRISPRi screen in K562 cells stably expressing dCas9-KRAB 
and alibrary of sgRNAs with genome-wide coverage. b, Selected gene hits for 
regulation of NA-647 internalization by LYTACs. c, Gene ontology (GO) 
annotation for significant hits (<5% FDR). d, e, Cell surface expression levels of 
CI-M6PRin dCas9-KRAB K562 cells (d) or dCas9-KRAB HeLa cells (e) 
transfected with control sgRNA, /GF2R-targeting sgRNA, FXOCI-targeting 
sgRNA or EXOC2-targeting sgRNA. Cells were stained for CI-M6PRand 


of CRISPRi single guide RNAs (sgRNAs) and incubated with LYTAC and 
NA-647. A population of cells that exhibited a substantial decrease in 
NA-647 labelling was isolated by fluorescence-activated cell sorting 
(FACS), and next-generation sequencing was performed to identify 
sgRNAs that were overrepresented in this population. Guide RNAs to 
IGF2R (which encodes CI-M6PR) were highly enriched in the sorted 
pool, providing unbiased confirmation of selective receptor targeting 
(Fig. 2b; for the full dataset, see Supplementary Information). Other 
significantly enriched guide RNAs (false discovery rate (FDR) < 5%) 
targeted genes that regulate endosomal acidification, vesicle traffick- 
ing, endosome-lysosome fusion and clathrin-dependent endocytosis 
(Fig. 2c), consistent with a CI-M6PR-driven endocytic pathway to lys- 
osomes’’. Components of the exocyst complex—which is reported 
to mediate vesicle localization to the plasma membrane**—were also 
identified as essential for the M6Pn-mediated internalization of NA-647. 
We proposed that the exocyst complex could be involved in the pres- 
entation of CI-M6PR onthe cell surface, a biochemical pathway which 
is yet to be elucidated”®. We performed CRISPRi knockdown of EXOCI 
and EXOC2 (which encode the exocyst complex components 1 and 2, 
respectively) in K562 cells using CRISPRi library guides, and meas- 
ured cell-surface CI-M6PR levels by flow cytometry. A 60% decrease 
in cell-surface CI-M6PR levels was observed (Fig. 2d). We replicated 
these results in HeLa cells (75% decrease, Fig. 2e), and found that total 
CI-M6PR levels did not change upon knockdown of EXOC1 in either 


subjected tolive-cell flow cytometry. f, g, Western blot analysis of EXOC1and 
CI-M6PR in K562 (f) and HeLa (g) CRISPRi knockdown lines. For c, analysis is 
representative of two replicates; d, f, g are representative of two independent 
experiments. K562 cells rapidly lost the exocyst knockdown phenotype after 
two passages. For e, data are mean +s.d. of three independent experimentsin 
HeLa and normalized to non-targeting sgRNA. Pvalues were determined by 
unpaired two-tailed ¢-tests. 


cell line (Fig. 2f, g, respectively). Further, we observed no change in 
surface expression of epidermal growth factor receptor (EGFR) on 
HeLa cells (Extended Data Fig. 3), indicating that the global traffick- 
ing of cell-surface proteins remained unperturbed. Together these 
data indicate that the surface presentation of CI-M6PR is mediated 
in part by the exocyst complex, and—more broadly—that LYTACs can 
be used to study the molecular pathways involved in the regulation of 
cell-surface receptors. 

We next sought to determine whether conjugation of a 
poly(M6Pn)-bearing glycopolypeptide to an antibody would repro- 
gramme the antibody to rapidly direct extracellular agents to the 
lysosome. As a proof-of-principle experiment, we non-specifically 
labelled lysine residues on a polyclonal anti-mouse IgG with 
bicyclononyne-N-hydroxysuccinimide (BCN-NHS), and subsequently 
conjugated this antibody to azide-terminated M6Pn glycopolypep- 
tides via copper-free strain-promoted azide-alkyne cycloaddition 
(Fig. 3a), generating the LYTAC Ab-1. The conjugation reaction could 
be readily monitored by native gel electrophoresis, because the cova- 
lent attachment of the anionic polypeptides caused a characteristic 
enhancement in migration compared with antibodies conjugated with 
non-charged polypeptides (that is, poly(GalNAc)) (Fig. 3b). Incubation 
of K562 cells with Ab-1 and a mouse IgG labelled with Alexa Fluor-488 
(AF488) (Extended Data Fig. 4a) resulted ina 40-fold increase inlysoso- 
mal AF488 signal relative to that of controls (Extended Data Fig. 4b, c). 
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Fig. 3| LYTACs target soluble proteins to lysosomes for degradation. 

a, Synthetic scheme for antibody-based LYTACs with a goat anti-mouse IgG. 

b, Native gel (with Coomassie staining) of polyclonal goat anti-mouse IgG 
conjugates. c—f, Uptake of mCherry antigen and its antibody by aLYTAC, into 
K562 cells (d), HEK-293 cells (e) or Jurkat cells (f) with SO nM mCherry, and 25 nM 
mouse anti-mCherry or 25 nM mouse anti-mCherry with 25 nM goat anti-mouse 
or conjugates after 1 h at 37 °C. Mean fluorescence intensity (MFI) was measured 
by live-cell flow cytometry. g, Uptake of ApoE4-647 into K562 cells; experiments 
were performed analogously to d. Median fluorescence intensity (MFI) was 
measured by live-cell flow cytometry. h, Cells were incubated asin g for 8hinthe 
presence or absence of 0.1mg mI! leupeptin (LPT), then lysed. Lysates were 
separated by SDS-PAGE, and ApoE4-647 was detected via in-gel fluorescence. 
Results are representative of two (h) or three (b) independent experiments. Data 
are mean +s.d. of three independent experiments (d-g). Pvalues were 
determined by unpaired two-tailed ¢-tests.; fold changes are reported relative to 
incubation with protein targets alone. 


Given that Ab-1efficiently trafficked IgG molecules to lysosomes, we 
reasoned that it could function as aLYTAC for a primary IgG antibody 
bound to its antigen (Fig. 3c). Co-incubation of cells with mCherry, 
mouse anti-mCherry and Ab-1 resulted in increases in uptake of 
between 10-fold and 100-fold relative to non-M6Pn-bearing antibodies 
incelllines (Fig. 3d—-f). We next aimed to expand this strategy to a clini- 
cally relevant target, selecting apolipoprotein E4 (ApoE4), which has 
been implicated in the pathogenesis of neurodegenerative disease’. 
A 13-fold increase in uptake of ADOE4-647 was observed using Ab-1 
and a mouse anti-ApoF4 primary antibody (Fig. 3g), and uptake was 
continuous over the course of the incubation (Extended Data Fig. 4d). 
Co-localization of the AF-647 signal with lysosomes was observed at 
both1hand 24 h (Extended Data Fig. 4g), replicating our findings with 
NeutrAvidin and IgG. Notably, non-specific inhibition of serine and 
cysteine proteases resulted in the accumulation of intracellular ADoE4- 
647, and this accumulation was substantially increased in the presence 
of anti-ApoE4 with Ab-1 (Fig. 3h, Extended Data Fig. 4e, f). These data 
demonstrate that LYTAC-mediated target enrichment in the lysosome 
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is coupled with target degradation. More broadly, conjugation of a 
lysosome-targeting ligand to an antibody can reprogramme the anti- 
body to direct an extracellular antigen for degradation. 

We next asked whether LYTACs could be used to accelerate the deg- 
radation of membrane-bound extracellular proteins. In principle, this 
requires simultaneous binding of a surface-associated protein and 
engagement of CI-M6PR, an interaction which would also benefit 
from increased effective molarity upon target binding (Fig. 4a). We 
first targeted EGFR, a known driver of cancer proliferation that can 
perform multiple scaffolding functions regardless of inhibition of 
its receptor tyrosine kinase activity. LYTACs were constructed using 
cetuximab (ctx)—an EGFR-blocking antibody approved by the Food and 
Drug Administration (FDA)—using a similar scheme to Ab-1 (Extended 
Data Fig. 5a). HeLa cells were incubated for 24 h with 100 nM ctx, ctx 
functionalized with either long or short M6Pn glycopolypeptides, 
or GalNAc-functionalized ctx as a control. Cells were then lysed and 
assayed for total EGFR levels (Extended Data Fig. 5b). Substantial deg- 
radation of EGFR was observed only with ctx conjugates bearing M6Pn 
glycopolypeptides (greater than 70% degradation, comparable to 
EGF-induced downregulation), while no changes in CI-M6PR levels 
were observed. There was no apparent difference in degradation abil- 
ity between LYTACs bearing short or long M6Pn glycopolypeptides. 
To demonstrate that depletion of EGFR is mediated by CI-M6PR, we 
compared the degradation performance of ctx-M6Pn Ab-2 (Extended 
Data Fig. 5c) to that of unmodified ctx in dCas9-KRAB HeLalines stably 
expressing a non-targeting or an/GF2R-targeting sgRNA. Degradation 
of EGFR was observed only in HeLa cells that expressed non-targeting 
sgRNA, whereas knockdown of /GF2R completely abrogated the degra- 
dation mediated by Ab-2 (Fig. 4b). Equivalent results were obtained with 
concentrations of Ab-2 of between 100 nM and 10 nM (Extended Data 
Fig. Se). EGFR degradation was observed after 3 hand reached a maxi- 
mum between 12 hand 24h, persisting for at least 72 h (Extended Data 
Fig. 5f, g). Exogenous M6P inhibited the degradation of EGFR (Fig. 4c) 
in the presence of Ab-2, whereas treatment of cells with chloroquine 
prevented Ab-2-promoted degradation, demonstrating a dependency 
both on M6P binding and on lysosomal acidification. To rule out the 
possibility that EGFR crosslinking by ctx contributes to target down- 
regulation®”™, we digested ctx with papain to obtain ctx-Fab—which 
binds EGFRina monovalent fashion—and conjugated it to poly(M6Pn) 
to generate Fab-1 (Extended Data Fig. 5d). Fab-1 accelerated the deg- 
radation of EGFR, in a CI-M6PR-dependent manner, to levels com- 
parable with those obtained using Ab-2 (Fig. 4d); this demonstrates 
that monovalent target binders can be used as LYTACs. Ina mixed cell 
assay with HeLa (CI-M6PR*EGFR’) and Jurkat (CI-M6PR*EGFR ) cells, 
the ctx-M6Pn conjugate exhibited a similar binding profile to that of 
ctx alone, suggesting that LYTACs with dual affinities for the target 
and CI-M6PR can maintain high on-target specificity (Extended Data 
Fig. 6). EGFR degradation was also observed in breast cancer (BT474 
and MDA-MB-361) and hepatocellular carcinoma (Hep3B and HepG2) 
cell lines (Fig. 4h, Extended Data Fig. 7a). 

Next, we used quantitative mass spectrometry to perform a 
proteome-wide characterization of LYTAC degradation specific- 
ity in cells treated with ctx or Ab-2. A significant reduction of EGFR 
levels was promoted by Ab-2, whereas ctx did not affect EGFR levels 
(Fig. 4e). CI-M6PR levels were unchanged, further supporting that 
the degradation of EGFR is catalytic in terms of CI-M6PR. Changes in 
the expression levels of other proteins as a result of EGFR degrada- 
tion were also observed (Extended Data Table 1; for the full dataset, 
see Supplementary Information). These co-degraded proteins may 
represent EGFR interaction partners or co-regulated molecules. For 
example, treatment with Ab-2 caused significant downregulation 
of dihydropteridine reductase, 7,8-dihydro-8-oxoguanine triphos- 
phatase andtranscription factor DP-1, and resulted inthe upregulation 
of glucose-1,6-bisphosphate synthase and coordinator of PRMTS and 
differentiation stimulator (COPRS). Although the precise mechanism 
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Fig. 4| LYTACs accelerate degradation of membrane proteins. a, Schematic 
of EGFR degradation using LYTACs. b, EGFR levels after treatment with 10 nM 
Ab-2 for 24 hin dCas9-KRAB HeLa cells expressing acontrol sgRNA or sgRNA 
targeting /GF2R.c, EGFR levels after treatment with 10 nM Ab-2 for 48 hin 
dCas9-KRAB HeLa cells expressing a control sgRNA in the presence of 5mM 
mannose-6-phosphate (M6P) or 200 pM chloroquine. d, EGFR levels after 
treatment with Fab-1in dCas9-KRAB HeLa cells expressing a control sgRNA or 
sgRNA targeting /GF2R. e, Fold change in the abundance of 3,877 HeLa proteins 
detected by quantitative proteomics analysis after 24-h treatment with either 
10 nM Ab-2 (left) or ctx (right) relative to untreated cells, data are the mean of 
three biological replicates. f, Cellular localization of EGFR after treatment with 
ctx or Ab-2 for 48 h. Scale bar, 20 pm. g, Levels of pEGFRin dCas9-KRAB HeLa 
cells expressing a control sgRNA after 24-hincubation with 10 nM ctx or Ab-2, 
then incubation with EGF for 10 min or 60 min. h, Levels of EGFR after treatment 
with10 nM Ab-2in Hep3B cells for 48 h. i, Levels of pEGFR, pAkt and pERK1/2in 


Time (h) 


Hep3B cells after 48 h incubation with 10 nM ctx or Ab-2, then incubation with 
EGF for 30 min.j, Degradation of CD71 mediated by a primary antibody and Ab-1. 
k, CD71levels after treatment with 50 nM anti-CD71 or anti-CD71 and Ab-1in 
Jurkat cells after 24 h. 1, Uptake of transferrin-647 in Jurkat cells treated with 
anti-CD71or anti-CD71and Ab-1 for 24 h. m, PD-L1 levels in HDLM-2 cells after 
treatment with 25 nM atezolizumab (atz) or atz-LY TAC for 48 h. n, Levels of 
PD-L1 degradation with 25 nM atz-LYTAC after 48 hin HDLM-2 cells inthe 
presence of 5mM M6P. 09, Serum levels of ctx or ctx-LYTAC in BALB/c mice 
injected intraperitoneally at5 mg kg“. p, Quantification of serum ctx-M6Pn 
relative to ctx after intraperitoneal injection. Data are representative of 

two (c,d, f, g,1,n) or three (b,h, i, k, 0) independent experiments or mice (0). 
Data are meants.d.(m) or mean+s.e.m. (p) of three independent experiments 
(m, one of whichis shown inn) or mice (p). Pvalues were determined by 
unpaired two-tailed t-tests. Relative values were calculated via densitometry 
and normalized to loading control (b-d,h,k, m, n) or relative to ctx levels (p). 
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for these changes remains to be elucidated, these data suggest that 
LYTACs may provide a means to monitor cellular changes in response 
to targeted protein degradation. 

We compared the functional consequences of LYTAC-mediated 
degradation to those of EGFR inhibition by ctx. Immunofluorescent 
microscopy revealed marked relocalization of EGFR from the plasma 
membrane to intracellular vesicles upon treatment with Ab-2 compared 
to treatment with ctx alone (Fig. 4f). To assess the consequences of 
degradation and removal from the plasma membrane, we incubated 
cells with ctx or Ab-2, then washed and treated them with EGF. In cells 
left untreated by ctx or Ab-2, we observed rapid phosphorylation of 
Tyrl1068 of EGFR that decreased over time—this is consistent with 
EGF-induced downregulation of EGFR (Fig. 4g). Cells treated with ctx 
exhibited modest levels of phosphorylation that were sustained for 
the course of the EGF incubation. The sustained levels of phosphoryla- 
tion are consistent with the ability of EGFR to engage in autophospho- 
rylation and thus enhance stimulatory ligand-binding events through 
oligomerization’. However, degradation of EGFR with Ab-2 reduced 
phosphorylation, demonstrating the advantages of receptor removal. 
This effect was dependent on CI-M6PR (Extended Data Fig. 5h). Because 
downstream kinase phosphorylation is independent of EGFR phos- 
phorylationinnon-starved HeLa cells*, we used Hep3B hepatocellular 
carcinoma cells. Ab-2 facilitated robust EGFR degradation in Hep3B 
cells (Fig. 4h), and this degradation desensitized cells to EGF stimula- 
tion (Fig. 4i). Notably, although ctx treatment inhibited EGFR phos- 
phorylation, levels of downstream phosphorylated Akt (pAkt) were 
similar between ctx and untreated cells. Only with Ab-2 treatment 
was downstream pAkt reduced to the levels observed in unstimulated 
cells. Proliferation of HepG2 cells is known to be stimulated by EGF®. 
We showed that Ab-2 inhibited the proliferative effects of EGF stimu- 
lation in HepG2 cells, relative to treatment with ctx alone (Extended 
Data Fig. 7a, b). 

To demonstrate the scope of the LYTAC platform, we targeted 
several additional membrane-associated proteins for degradation. CD71 
(transferrin receptor-1)—a therapeutic cancer target that is progressing 
towards clinical trials—is known to cycle between early endosomes and 
the cell surface, avoiding trafficking to the lysosome for degradation®. 
Upon incubation of Jurkat cells with a mouse anti-CD71 antibody and 
Ab-1 (Fig. 4j), degradation of CD71 was observed (greater than 80%; 
Fig. 4k), which was dependent on M6Pn recognition (Extended Data 
Fig. 8g). Note that the anti-CD71 antibody alone is reported to induce 
partial CD71 downregulation”. After 24-h incubation, cells treated with 
anti-CD71and Ab-1 internalized less transferrin-647 than did untreated 
cells or cells treated with non-M6Pn-bearing antibodies (Fig. 41), fur- 
ther highlighting the functional consequences of target degradation. 

Additionally, we investigated whether LYTACs could degrade PD-L1, 
a driver of cancer cell immune evasion. For PD-L1 to undergo acceler- 
ated degradation, the LYTAC must overcome the PD-L1 recycling path- 
way?®, MDA-MB-231 cells—which are PD-L1-positive—were treated with 
anti-PD-L1, anti-PD-L1 functionalized with GalNAc glycopolypeptides 
or anti-PD-L1-M6Pn LYTAC (Ab-3, Extended Data Fig. 8a). Treatment 
with Ab-3 resulted ina significant decrease (average 33%) in cell-surface 
PD-L1 levels compared to treatment with unfunctionalized anti-PD-L1 
or anti-PD-L1-poly(GalNAc) (Extended Data Fig. 8b, c). As MDA-MB-231 
cells express low levels of CI-M6PR, we tested the degradation of PD-L1 
in the Hodgkin’s lymphoma cell line HDLM-2, which expresses higher 
levels of CI-M6PR. After 36 h of treatment with Ab-3, 50% downregu- 
lation of total PD-L1 was observed (Extended Data Fig. 8d, e). When 
treated with a LYTAC derived from atezolizumab (the active ingredient 
in Tecentriq) (Extended Data Fig. 8f)—an FDA-approved PD-L1-blocking 
antibody—we observed a greater extent of PD-L1 degradation (average 
70% degradation, Fig. 4m), which was dependent on M6Pn-recognition 
(Fig. 4n). Thus, LYTACs can override the endogenous recycling 
programs of cell-surface proteins and accelerate their lysosomal 
degradation. 
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Finally, we asked how M6Pn conjugation might affect antibody clear- 
ance in vivo. BALB/c mice were injected intraperitoneally with mgkg™ 
ctx or ctx-M6Pn, and serum antibody levels were assayed by western 
blotting. Six hours after injection, we observed a threefold reduction 
in serum ctx-M6Pn relative to ctx, suggesting a rapid clearance phase 
post-injection (Fig. 40, p). However, between 6h and 72h we observed 
only a moderate decrease in serum ctx-M6Pn relative to ctx, and no 
substantial accumulation in either the liver or the spleen (Extended 
Data Fig. 9). These data suggest two regimes of clearance: a rapid ini- 
tial phase and a slower, more sustained clearance. Modulating these 
two regimes will be important for tailoring the in vivo efficacy of any 
individual combination of LYTAC and target protein. 

Insummary, the LYTAC platform enables targeting of extracellular 
and membrane-associated proteins directly for lysosomal degradation. 
The success ofa particular LYTAC is likely to result from a combination 
of factors—including the endogenous kinetics of protein trafficking and 
turnover, the amount of surface localization, the inherent susceptibility 
to lysosomal transportation through clathrin-mediated endocytosis 
and the stoichiometry relative to the lysosome-targeting receptor. 
Although the LYTACs described above take advantage of CI-M6PR, in 
principle other shuttling receptors could be co-opted, reducing the 
potential for resistance to develop when targeting oncoproteins. We 
have demonstrated that both small molecules and larger peptides can 
be used as protein target binders in LYTACs. Modulating the pharma- 
cokinetic properties to control off-target clearance and stoichiometries 
ofagiven LYTAC necessary to effectively degrade a membrane protein 
will remain a key challenge for further translational applications. We 
anticipate that the chemical tunability and modularity of LYTACs will 
offer new opportunities to manipulate these parameters for targeted 
protein degradation of secreted and membrane proteins, both for 
research and for potential therapeutics. 
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Methods 


Data reporting 

No statistical methods were used to predetermine sample size. The 
experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


General synthetic chemistry procedures 

All reactions were performed in standard, dry glassware fitted with 
rubber septa under an inert atmosphere of nitrogen unless otherwise 
described. Reagents were purchased in reagent grade from com- 
mercial suppliers and used as received, unless otherwise described. 
Anhydrous solvents (acetonitrile, benzene, dichloromethane, diethyl 
ether, N,N-dimethylformamide, tetrahydrofuran and toluene) were 
prepared by passing the solvent through an activated alumina column. 
See the Supplementary Information for detailed synthetic procedures 
and characterization of all new compounds. 


Celllines 

Cells were grown in T75 flasks (Thermo Fisher) and maintained at 37 °C 
and 5% CO,. K562 and Jurkat cells were grown in RPMI supplemented 
with 10% fetal bovine serum (FBS) and 1% penicillin/streptomycin. 
THP-1 cells were grown in RPMI supplemented with 10% FBS, 1% penicil- 
lin/streptomycin, and 0.05 mM 2-mercaptoethanol. HeLa, MDA-MB-361, 
HEK-293, HEK-293T, Hep3B and MDA-MB-231 cells were grown in DMEM 
supplemented with 10% FBS and 1% penicillin/streptomycin. HepG2 
cells were grown in low-glucose DMEM supplemented with 10% FBS 
and 1% penicillin/streptomycin. BT474 cells were grown in RPMI sup- 
plemented with 10% FBS and 1% penicillin/streptomycin with 0.023 U 
ml‘insulin. HDLM-2 cells were grownin RPMI supplemented with 20% 
heat-inactivated FBS and 1% penicillin/streptomycin. HeLa and K562 
cells stably expressing dCas9-KRAB were a gift from M. Bassik. Jurkat, 
HeLa, MDA-MB-361, MDA-MB-231, THP-1, K562, HEK-293, HEK-293T, 
Hep3B, HepG2 and BT474 cells were obtained from the American Type 
Culture Collection (ATCC), HDLM-2 cells were obtained from the labora- 
tory of T. Waldmann. Cell lines were authenticated by the supplier. Cell 
lines were verified to be mycoplasma-negative using aLonza MycoAlert 
Mycoplasma Detection Assay. 


General procedure for NeutrAvidin-647 uptake experiments 
Cells (70% confluent in 24-well plate for adherent cells, 1 x 10° cells per 
ml for suspension cells) were incubated in complete growth medium 
supplemented with 500 nM NA-647, or NA-647 and 2 uM glycopoly- 
peptide. Addition was performed sequentially, with NA-647 added 
first followed by glycopolypeptide and immediate transfer to 37 °C. 


General procedure for protein uptake and degradation 
experiments 

Cells (70% confluent in 24-well plate for adherent cells, 2 x 10° cells per 
ml for suspension cells) were incubated in complete growth medium 
supplemented with 50 nM target protein, or 50 nM target protein and 
25 nM of each antibody. Addition was performed sequentially, with 
target protein added first followed by secondary antibodies, followed 
by primary antibodies, and subsequent immediate transfer to 37 °C. 
Recombinant proteins were expressed in-house (mCherry) or pur- 
chased from Abcam (human apolipoprotein E4, ab50243) and labelled 
with commercially available fluorescent dyes. 


CRISPRi screen procedure 

The hCRISPRi-v2 library was a gift fromJ. Weissman (Addgene ID 83969). 
K562 cells stably expressing dCas9-KRAB were infected with the 104,535 
sgRNA library described previously”, targeting the transcriptional 
start sites of 18,905 annotated genes at a redundancy of 5 sgRNA per 
start site. Infectivity was titrated such as to transduce the library at a 
multiplicity of infection of 0.3-0.4. Cells were selected with puromycin 


for 96 hand then allowed to expand in puromycin-free media for 48 h. 
Staining and FACS was performed within seven days of completion of 
puromycin selection. On the day of sorting, two replicates of 50 x 10° 
library-infected cells (4 x 10° cells per ml) were incubated with NA-647 
(500 nM) and poly(M6PN)jong (2 LM) for 1h at 37 °C. Cells were washed 
three times with PBS at 4 °C. Cells were then resuspended in PBS with 
SYTOX Green (according to the manufacturer’s specifications) and 
sorted by FACS using a BD Aria II. Intact, viable cells were selected by 
gating on FSC/SSC and SYTOX Green channels. A population of cells 
representing the bottom 15% of the fluorescence distribution in the 
NA-647 channel was then selected for and sorted. Sorting was con- 
ducted until 25 x 10° events had been processed, corresponding to 
an approximately 250-fold library coverage. Sorted cells were then 
pelleted and frozen for subsequent downstream processing. Aliquots 
of 50 x 10° unsorted cells from each replicate were also pelleted and 
frozen down in parallel for normalization. 


CRISPRi screen DNA extraction and data analysis 

Frozen cell pellets were thawed and genomic DNA extraction was per- 
formed using either the QlAamp DNA Blood Maxi Kit (for unsorted 
samples) or the Sigma GeneElute Mammalian Genomic DNA Mini- 
prep kit (for sorted samples) according to manufacturer’s specifica- 
tions. The sgRNA-encoding regions were amplified via nested PCR 
and sequenced onan Illumina NextSeq500. Alignment of sequencing 
reads to the sgRNA library and statistical comparison of positively 
selected genes in sorted versus unsorted samples was performed using 
MaGeCK”’. MaGeCK returned a statistical score for gene enrichment 
inthe sorted population and genes were ranked by a positive selection 
score corresponding to the -log(PosScore). GO enrichment analysis 
was performed using GOrilla*®. 


Generation of CRISPRi-knockdown cell lines 
K562 or HeLa cells stably expressing dCas9-KRAB were infected with 
lentivirus-encoding sgRNAs identified from the CRISPRi screen. Lentivi- 
rus was produced by co-transfection of HEK-293T cells with a lentiviral 
transfer vector and packaging plasmids (pGag/pol, pREV, pTAT, pVSVG). 
Transfection was performed using Lipofectamine 2000 reagent as 
recommended by the manufacturer. Viral supernatants were collected 
48 hafter transfection, filtered through a 0.45-pm filter, and added to 
target cells (400 pl viral supernatant per well in a 6-well plate) along 
with polybrene (8 pg per ml). Then, 48 h post infection, cell media was 
replaced and cells were placed under puromycin selection (2 pg mI’). 
sgRNA guides: /GF2R 5’-GAGGTGAGCGCGGCTCGACT-3’; EXOC1 
5’-GGGCGGACAGACGAGCTGAC-3’; EXOC2 5’-GGGCGGAAGTGA 
GGTGCCGG-3’; non-targeting (GAL4) 5’-GTTGGAGCACTGTC 
CTCCGAACGT-3’. 


Flow cytometry for surface staining and uptake measurements 

For protein uptake experiments in suspension cell lines, cells were 
incubated for the indicated time with proteins and conjugates, then 
washed 3 times with PBS at 4 °C. Cells were then incubated with either 
SYTOX Green or SYTOX Red according to the manufacturer’s specifi- 
cations for 15 min on ice. For protein uptake experiments in adherent 
cell lines, cells were incubated for the indicated time with proteins 
and conjugates, then washed two times quickly with PBS and lifted 
with trypsin. Cells were transferred to a 96-well V-bottom plate and 
washed 3 times with PBS, then incubated with either SYTOX Green or 
SYTOX Red according to the manufacturer’s specifications for 15 min 
on ice. For surface-staining experiments, adherent cells were lifted 
with non-enzymatic cell dissociation buffer (Gibco) and transferred 
to a 96-well V-bottom plate, and suspension cells were transferred to 
a 96-well V-bottom plate. After washing 3 times with 0.5% BSA in PBS, 
cells were incubated with primary antibody for 30 min onice. Cells were 
subsequently washed 3 times with 0.5% BSA in PBS, then incubated with 
secondary antibodies for 30 min onice. After incubation, cells were 


washed 3 times with 0.5% BSA in PBS and incubated with either SYTOX 
Green or SYTOX Red according to the manufacturer's specifications 
for 15 min onice. Flow cytometry was performed on either a BD-Accuri 
C6 Plus or DxP FACScan flow cytometer, and analysis was performed 
using the FlowJo software package. Gating was performed on single 
cells and live cells. 


Cell surface degradation experiment 

Adherent cells (70% confluency for 24-h experiments, 50% confluency 
for 48-h experiments) were treated with antibodies or conjugates in 
complete growth medium. At the indicated time, cells were washed 
three times with Dulbecco’s phosphate-buffered saline (DPBS), then 
lysed as detailed below. Suspension cells were suspended in complete 
growth media in a flat bottom plate, and at the indicated time trans- 
ferred to a 96-well V-bottom plate and washed three times with PBS, 
then lysed as detailed in ‘Western blotting’. 


Competitive EGF stimulation experiment 

Adherent cells (70% confluency for 24-h experiments, 50% confluency 
for 48-h experiments) were treated with antibodies or conjugates in 
complete growth medium. At the indicated time, cells were washed 
three times with DPBS, then incubated with EGF in fresh complete 
growth medium. After incubation, cells were washed three times with 
DPBS, then lysed as detailed in ‘Western blotting’. 


Western blotting 

Cell culture experiments with adherent cells conducted in 24-well 
plates were washed 3 times with DPBS, then lysed with RIPA buffer 
supplemented with protease inhibitor cocktail (Roche), phosphatase 
inhibitor cocktail (Cell Signaling Technologies) and 0.1% benzonase 
(Millipore-Sigma) onice for 30 min. The lysates were spun at 21,000g 
for 15 min at 4 °C and protein concentration was determined using 
BCA assay (Pierce). Suspension cell lines were transferred to a 96-well 
V-bottom plate, washed 3 times with PBS, then lysed with RIPA buffer 
supplemented as above. Adherent cells in 6-well plates were washed 
three times with DPBS then lysed with 1X Laemmli buffer containing 
protease inhibitor cocktail (Roche) and phosphatase inhibitor cock- 
tail (Cell Signaling Technologies). Lysates were then sonicated on ice 
using a microtip sonicator for 30 s, with 10 s sonication intervals and 
rest intervals, and protein concentration was determined using BCA 
assay (Pierce). Equal amounts of lysates were separated by SDS-PAGE 
(4-12% Bis-Tris gel or 10% Bis-Tris gel), then transferred to a nitrocel- 
lulose membrane. After transfer, the membrane was stained for total 
protein using REVERT Total Protein Stain (LI-COR), then blocked in 
TBS-T with 5% non-fat dried milk or Odyssey Blocking Buffer (TBS) 
(LI-COR) for 1h at room temperature with gentle shaking. Membranes 
were incubated overnight with primary antibodies at 4 °C with gentle 
shaking, then washed three times with TBS-T for five minutes each. 
The membrane was then incubated with 800CW goat anti-mouse IgG, 
800CW goat anti-rabbit IgG, 8300CW donkey anti-goat IgG, 680LT goat 
anti-rabbit IgG or 680LT goat anti-mouse IgG secondary antibodies 
(1:10,000) in Odyssey Blocking Buffer (TBS) for 1h at room temperature 
with gentle shaking. Membranes were washed three times with TBS-T, 
then imaged using an OdysseyCLxImager (LI-COR). Quantification of 
band intensities was performed using Image Studio Software (LI-COR). 


Confocal microscopy 

A Nikon A1R confocal microscope equipped with a Plan Fluor 60x oil 
immersion 1.30-numerical aperture objective was used. This instru- 
ment is equipped with a 405-nm violet laser, a488-nm blue laser, a 
561-nm green laser and a 639-nm red laser. 


Mass spectrometry 
Sample preparation. Samples were prepared for proteomic analysis 
using a previously described method with methanol precipitation to 


extract proteins, followed by tryptic digestion”. In brief, HeLa cells from 
each treatment condition were lysed by suspension in 6 M guanidine, 
total protein concentration was measured using BCA (Thermo Fisher 
Scientific), and 100 pg of protein was extracted via precipitation in 90% 
methanol for each sample. Supernatants were decanted and protein 
pellets were suspended in a buffer comprising 8 M urea, 100 mM Tris 
pH8, 10 mM tris(2-carboxyethyl)phosphine and 40 mM chloroaceta- 
mide. After incubation at room temperature for 30 min, samples were 
diluted to 1.5 M urea with 100 mM Tris pH 8 and digested with trypsin 
(50:1 protein to enzyme, Promega) overnight. Sample cleanup was 
performed by first quenching with formic acid to a final pH of around 
2, followed by desalting over a polystyrene-divinylbenzene solid phase 
extraction (PS-DVB SPE) cartridge (Phenomenex). Samples were dried 
with vacuum centrifugation after desalting, and peptide mass was as- 
sayed using a peptide colorimetric assay (Fisher). 


LC-MS/MS. Allsamples were resuspended in 0.2% formic acid in water 
at around 500 ng pI", and 500 ng of total peptide mass was injected 
on column for each sample. Peptides were separated over a 25-cm 
EasySpray reversed phase LC column (75 pm inner diameter packed 
with 2 um, 100 A, PepMap C18 particles, Thermo Fisher Scientific). 
The mobile phases (A: water with 0.2% formic acid and B: acetonitrile 
with 0.2% formic acid) were driven and controlled by a Dionex Ultimate 
3000 RPLC nano system (Thermo Fisher Scientific). Gradient elution 
was performed at 300 nl min”. Mobile phase B was increased to 5% 
over 6 min, followed by an increase to 40% at 76 min, aramp to 90% B 
at 77 min, and a wash at 90% B for 10 min. Flow was then ramped back 
to 0% B over the course of 1 min, and the column was re-equilibrated 
at 0% B for 12 min, for atotal analysis of 100 min. Eluted peptides were 
analysed onan Orbitrap Fusion Tribrid MS system (Thermo Fisher Sci- 
entific). Precursors were ionized using an EASY-Spray ionization source 
(Thermo Fisher Scientific) source held at +2.2 kV compared to ground, 
andthe column was held at 40 °C. The inlet capillary temperature was 
held at 275 °C. Survey scans of peptide precursors were collected in 
the Orbitrap from 350-1500 Th with an automatic gain control target 
of 1,000,000, a maximum injection time of 50 ms and a resolution of 
120,000 at 200 m/z. Monoisotopic precursor selection was enabled 
for peptide isotopic distributions, precursors of z=2-5 were selected 
for data-dependent MS/MS scans for 2s of cycle time, and dynamic 
exclusion was set to 45 s with a+10 ppm windowset around the precur- 
sor monoisotope. An isolation window of 0.7 m/z was used to select 
precursor ions with the quadrupole. MS/MS scans were collected using 
high-energy collisional dissociation at 30 normalized collision energy 
(nce) with an automatic gain control target of 30,000 anda maximum 
injection time of 25 ms. Mass analysis was performed in the linear ion 
trap using the ‘Rapid’ scan speed while scanning from 200-1,500 m/z. 


Data analysis. Raw data were processed using MaxQuant version 
1.6.3.4”, and tandem mass spectra were searched with the Andromeda 
search algorithm”. Oxidation of methionine and protein N-terminal 
acetylation were specified as variable modifications, while carbami- 
domethylation of cysteine was set as a fixed modification. A precursor 
ion search tolerance of 20 ppm anda production mass tolerance of 0.3 
Da were used for searches, and two missed cleavages were allowed for 
full trypsin specificity. Peptide spectral matches were made against a 
target-decoy human reference proteome database downloaded from 
Uniprot. Peptides were filtered to a1% FDR and a1% protein FDR was 
applied according to the target-decoy method“. Proteins were identi- 
fied and quantified using at least one peptide (razor + unique), where 
razor peptide is defined as a non-unique peptide assigned to the pro- 
tein group with the most other peptides (Occam’s razor principle). 
Proteins were quantified and normalized using MaxLFQ* with a 
label-free quantification (LFQ) minimum ratio count of 1. LFQ intensi- 
ties were calculated using the match between runs feature, and MS/ 
MS spectra were required for LFQ comparisons. For quantitative 
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comparisons, protein intensity values were log,-transformed before 
further analysis, and missing values were imputed from a normal 
distribution with width 0.3 and downshift value of 1.8 (that is, default 
values) using the Perseus software suite*®. Significance calculations 
were performed using a two-tailed t-test with heteroscedastic variance 
(performed in Microsoft Excel). 


Antibody in vivo clearance study 

Mouse experiments were performed in compliance with ethical regula- 
tions approved by the Administrative Panel on Laboratory Animal Care 
(APLAC) at Stanford University under protocol 31511. Female BALB/c 
mice (6-8 weeks old, from the Jackson Laboratory) were treated with 
5mg kg‘ of either antibody or conjugate via intraperitoneal injection 
(3 mice per group). The initial grouping of mice was random, after which 
no additional randomization or blinding was used. At the indicated 
time, blood was sampled from the tail using anti-coagulant capillary 
tubes, and serum was separated after centrifugation at 700g at 4 °C for 
15 min. To analyse the presence of human antibody in serum, 2 pl serum 
was diluted into 30 pl LDS loading buffer and separated by SDS-PAGE 
(4-12% or 10% Bis-Tris gel), then transferred to a nitrocellulose mem- 
brane. Liver and spleen were collected after 72 h, homogenized and 
lysed with RIPA buffer supplemented with protease inhibitor cocktail 
and 0.1% benzonase, and 50 ug of organ lysate were separated by SDS- 
PAGE (4-12% Bis-Tris gel). The presence of cetuximab was detected 
using 800CW goat anti-human IgG secondary antibody (1:1,000). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


CRISPRi screen results are provided in Supplementary Table 1, 
quantitative proteomics results are provided in Supplementary Table 2. 
The flow cytometry gating strategy is provided inthe Supplementary 
Information. All data that supported the findings of this study are 
included and are also available from the corresponding author upon 
request. Source data are provided with this paper. 
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Extended Data Fig. 1| Synthesis of M6Pn-NCA, poly(mannose- 
6-phosphate-co-Ala), poly(mannose-co-Ala) and poly(GalNAc-co-Ala). 

a, Synthetic route to mannose-6-phosphonate-serine N-carboxyanhydride 
(NCA). b, Synthetic route to M6P-NCA, followed by Ni-catalysed 
polymerization. Polymerization reactions were carried out in aN, glovebox for 
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Extended Data Fig. 2 | See next page for caption. 
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Extended Data Fig. 2 | Biotin-poly(M6Pn) LYTACs direct NA-647 to 
lysosomes. a, General scheme for biotinylation of glycopolypeptides with 
sulfo-NHS biotin. Biotinylation reactions were performed in 1x PBS at room 
temperature overnight. b, Biotin-LY TAC-mediated NA-647 uptake is 
continuous over time in K562 cells. K562 cells were incubated at 37 °C in 
complete growth medium with 500 nM NA-647 or 500 nM NA-647 and 2 uM 
poly(M6Pn),ho: for the indicated time, then washed and analysed by live-cell 
flow cytometry. The MFI (mean fluorescence intensity) was measured relative 
to background fluorescence from untreated K562 cells. c,d, Biotinylated 


poly(M6Pn) LYTACs direct NA-647 to lysosomes in K562 cells (c) andJurkat 
cells (d). Cells were incubated with PBS, 500 nM NA-647, or 500 nM NA-647 and 
2 uM biotinylated poly(M6Pn),,,,,; for 0.5-1h in complete growth medium. 
NA-647 (red) colocalized with acidic endosomes and lysosomes as labelled 
with LysoTracker Green (turquoise). Scale bar, 20 um. Fluorescence intensity is 
normalized in the NA-647 channel for allimages. Force, d, dataare 
representative of two independent experiments. For b, data are mean +s.d. of 
three independent experiments. 
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Extended Data Fig. 3| EGFR surface levels are unchanged upon EXOCI1 and levels. Cells were subjected to live-cell flow cytometry using cetuximab 
EXOC2 knockdown in HeLa cells. Cetuximab binds equally to dCas9-KRAB followed by an anti-human Alexa Fluor-647-conjugated anti-human 
HeLa cells transfected with non-targeting sgRNA and cells transfected with (anti-human 647) secondary antibody. Data are representative of two 


sgRNA targeting /GF2R, EXOCI or EXOC2, indicating no change in EGFR surface independent experiments. 
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Extended Data Fig. 4| Ab-1 mediates uptake of soluble proteinsto 
lysosomes. a, Uptake of an Alexa Fluor-488 (AF488)-labelled mouse IgG 
(m-IgG-488) into cells using antibody LYTACs. b, Uptake of m-IgG-488 using 
Ab-1. Mean fluorescence intensity (MFI) fold change over background uptake 
measured by live-cell flow cytometry. K562 cells were incubated at 37 °C for1h 
with 50 nM IgG-488 or 50 nM IgG and 25 nM of anti-mouse or Ab-1.c, AF488 
signal (green) colocalized with acidic endosomes and lysosomes as labelled 
with LysoTracker Red (magenta). Expanded view shows a cell containing IgG- 
488 and LysoTracker Red. Scale bar, 20 um. d, ADoE4-647 uptake over time. 
K562 cells were incubated with SO nM ApoE4-647 in the presence or absence of 
25 nM anti-ApoE4 and Ab-1. At the indicated time point, cells were aliquoted 
and median fold intensity (MFI) measurements were measured by live-cell flow 


cytometry. e, Total protein levels for leupeptin inhibition of apoE4 degradation 
in K562 cells, corresponding to lanes shown in Fig. 3h. Total protein was 
visualized by Coomassie stain. f, Flow cytometry plots of ApoE4-647 uptake 
over time, with or without leupeptin inhibition. g, Uptake of ADoE4-647 to 
lysosomes. K562 cells were incubated with PBS, 50 nM ApoEF4-647, 25 nM anti- 
ApoF4 and 25 nM Ab-1for1hor 24 hin complete growth media at 37 °C. Alexa 
Fluor-647 signal (red) colocalizes with acidic endosomes and lysosomes as 
labelled with LysoTracker Green (turquoise). Data are representative of two 
(c, e-g) independent experiments. Data are mean +s.d. of three independent 
experiments (b, d). Pvalues were determined by unpaired two-tailed t-tests; 
fold changes are reported relative to incubation with protein targets alone (b) 
or background fluorescence (d). 
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Extended Data Fig. 5 | Optimization of LYTAC-mediated EGFR degradation. 


a, Native gel of cetuximab (ctx)-based LYTACs. b, Levels of EGFRin HeLa cells 
treated with 100 nM ctx (lane 3), ctx-GalNAc (lane 4), ctx-M6PNiong (lane 5) or 
Ctx-M6PNghort (lane 6) for 24 hin complete growth medium. EGF stimulationisa 


positive control for EGFR downregulation. c, Synthesis of linker-swapped Ab-2. 


Ctx was labelled with NHS-PEG,-N;, then incubated with BCN-functionalized 
poly(M6Pn),,;: for 3 days at room temperature. Reaction progress was 
monitored by native gel electrophoresis and visualized with Coomassie stain. 
d, Native gel of ctx-Fab-based LYTACs. e, EGFR levels in dCas9-KRAB HeLa cells 
transfected with non-targeting sgRNA against GAL4 after incubation with100 
nM,10nM,1nM, or 0.1nM conjugates for 36 hin complete growth medium. 


f, EGFR levels in dCas9-KRAB HeLa cells transfected with non-targeting GAL4 
sgRNA incubated with Ab-2 or ctx for the indicated time. g, Quantification of 
LYTAC or ctx-mediated EGFR degradation in dCas9-KRAB HeLa expressing 
GAL4sgRNA over time as read out by western blot relative to untreated cells. 

h, Levels of pEGFR in dCas9-KRAB HeLa cells expressing an sgRNA targeting 
IGF2R after 24 hincubation with 10 nM ctx or Ab-2, then incubation with EGF for 
10 or 60 min. Data are representative of two (a-e, h) or three (f) independent 
experiments. For g, data are mean+s.d. of three independent experiments, 
one of which is shown inf. Per cent control was calculated by densitometry and 
normalized to loading control (b,e, f). 
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Extended Data Fig. 6 | Mixed-cell assay demonstrates that binding 
specificity is comparable between ctx-M6Pn and ctx. a, Scheme for mixed 
cell assay. HeLa cells were lifted and labelled with CellTracker Deep Red, then 
mixed ina1:1ratio with Jurkat cells. The mixed cell sample was stained with 
either 10 nM ctx or ctx-M6Pn conjugate, followed by anti-human 488, then 
subjected to live-cell flow cytometry. b, Cell surface CI-M6PR levels on HeLa 


(50.3%) 


cells (CIM6PR*EGFR’) and Jurkat cells (CIM6PR*EGFR ) were measured by 
live-cell flow cytometry. HeLa and Jurkat cells exhibited similar levels of 
cell-surface CI-M6PR. c, Ctx and ctx-M6Pn exhibit equivalent binding to HeLa 
cells, and ctx-M6Pn exhibits minimal increased binding toJurkat cells relative 
toctx. Dataare representative of two independent experiments (b) or two 
experimental replicates (c). 
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Extended Data Fig. 7 | LYTACs mediate EGFR degradation in multiple cell are representative of three independent experiments (a). For b, data are 
lines. a, EGFR levels in BT-474, MDA-MB-361, or HepG2 cells after incubation mean +s.e.m. of three independent experiments. Pvalues were determined by 
with 10-20 nM conjugates. b, Proliferation of HepG2 cells inthe presence of unpaired two-tailed ¢-tests. Per cent control was calculated by densitometry 


EGF (200 ng mI’) and 50 nM cetuximab or Ab-2. Cells were incubated with EGF and normalized to loading control. 
and antibodies for 48 h, and proliferation measured using an MTS assay. Data 
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Extended Data Fig. 8 | Synthesis of anti PD-L1 glycopolypeptide 
conjugates, PD-L1 degradation, and CD71 degradation depends on M6P 
binding. a, Anti-PD-L1 was non-specifically labelled with BCN, then incubated 
with poly(M6Pn),ho: for 3 days at room temperature. Reaction progress was 
monitored by native gel electrophoresis and visualized by Coomassie stain. 

b, Cell-surface PD-L1 determined by live-cell flow cytometry after incubation 
with anti-PD-L1 or conjugates (50 nM). At each time point, cells were washed, 
lifted, brought to 4 °C, then stained for PD-L1 using excess unconjugated 
anti-PD-L1(1p1M).c, PD-L1 levels in MDA-MB-231 cells after 48-h incubation with 
anti-PD-L1 or Ab-3.d, PD-L1 levels in HDLM-2 cells after 36 h incubation with 
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anti-PD-L1 or Ab-3. e, Quantification of PD-L1 degradation in HDLM-2 cells with 
Ab-3. f, Atezolizumab was non-specifically labelled with NHS-(PEG),-N;, then 
incubated with poly(M6Pn),,,.,,-BCN for 3 days at room temperature. Reaction 
progress was monitored by native gel electrophoresis and visualized by 
Coomassie stain. g, Levels of CD71 in Jurkat cells after 24 hin the presence of 5 
mM MP. Dataare representative of two (a,c, f, g) independent experiments. 
Forb, data are mean+s.d. of three independent experiments, and cell surface 
levels are relative to untreated cells. Fore, data are mean+s.d. of three 
independent experiments, one of whichis shownind. Per cent control was 
calculated by densitometry and normalized to loading control. 
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Extended Data Table 1| Selected examples of proteins with differential abundance after treatment of HeLa cells with 
ctx or Ab-2 


Ab-2 (ctx-M6Pn Cetuximab 
Downregulated Downregulated 
Fold Change p-value Fold Change p value 
DHPR -3.08 0.023 TF2B -2.37 0.008 
TFDP1 -2.98 0.039 HIF1N -1.54 0.042 
80DP -1.82 0.019 Upregulated 
CD99 “1.8 0.051 Fold Change p value 
DANG acs ae CCD57 3.82 0.036 
WDR11 -1.38 0.049 IGKC 3.15 0.032 
BAGS “1° 012 NFIB 2.43 0.006 
AGAP3 -1.31 0.036 PO2F1 2.06 0.03 
SCO2 iach oer NRBP 2.01 0.011 
ORC2 -1.08 0.015 
Upregulated 
Fold Change p-value 
CCD57 3.82 0.046 
PGM2L 2.72 0.012 
NFIB 2.48 0.003 
ALKB5 2:21 0.029 
COPRS 2:13 5.4E-05 
IGKC 1.99 0.009 
FRYL 1.99 0.024 
DESP 1.12 0.0006 


Untreated, ctx-treated, or Ab-2-treated HeLa cells were lysed, digested, and analysed by quantitative proteomics. Depicted are exemplary hits with fold change values reported relative to 
untreated samples. Proteins highlighted in grey were found in both ctx and Ab-2 treated cells. For a full list, see Supplementary Information. Data are mean of three biological replicates. 
P values were determined by unpaired two-tailed t-tests with untreated cells. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection BD CSampler Plus or FACS Diva (v 8.0) were used for flow cytometry data acquisition. Wyatt ASTRA software package (v 7.0.1.24) was 
used for GPC data acquisition. Image Studio Software (v 5.2) was used for Western blot scanning. NIS Elements (v 5.11.00) was used for 
fluorescence microscopy image acquisition. Excalibur (v 4.2) was used for MS data acquisition. 


Data analysis MaxQuant (1.6.3.4) and the Andromeda search algorithm along with Perseus (1.6.22) were used for MS data analysis. MestReNova (v 
12.0.3) was used for all chemical NMR analysis. The Wyatt ASTRA software package (v 7.0.1.24) was used for analysis of aqueous GPC 
data. Microsoft Excel (v 16.23) and Graphpad Prism (v 5.0) were used for data analysis. FlowJo (v 10.0) was used for flow cytometry data 
analysis. ImageJ (v 1.0) was used for fluorescence microscopy image analysis. Image Studio Software (v 5.2) was used for Western blot 
quantification. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All data relevant to the manuscript are included, including full data sets from CRISPRi screens and proteomics and source data. Further details and information are 
available from the corresponding author upon request. 


= 
red) 
a 
SF 
= 
a 
= 
o 
wa) 
© 
red) 
= 
(a) 
aN 
= 
O 
se) 
fe) 
iS 
fee 
=) 
a 
Wn 
iS 
= 
S 
red) 
S 
< 


” 1290120 


8107 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat. pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No statistical method was used to determine sample size. Sample size choice was based on our experience with variability in these assays and 
our ability to detect meaningful differences between treatments. Variability was low based on our experience with western blot analysis and 
flow cytometry quantification experiments of protein uptake, degradation experiments, and surface measurement experiments. The sample 
size for mouse experiments was chosen based on previous experience with respect to the observed variability differences among groups 
when using western blot analysis of mouse serum to detect meaningful differences between treatments. 
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Data exclusions There were no data exclusions except those resulting from technical errors making data interpretation impossible (torn gels, blots, poor gel 
transfers). 


Replication All data were successfully replicated with multiple biological replicates as indicated in the Figure legends. Experiments in Extended Data Fig. 
6c were performed once with two experimental replicates. After two passages post-selection of dCas9-CRISPRi with EXOC1 sgRNA, K562 cells 


lost the EXOC1 phenotype. Data in Fig. 2 were obtained within one passage of one another. 


Randomization __ Initial choice of mouse for injection of antibody or conjugate was random. The order of analysis in flow cytometry experiments was 
randomized. No further formal randomization was employed as internal controls were employed for quantitative comparisons. 


Blinding Sample treatments and subsequent MS sample processing and MS analysis were blinded from one another. No additional blinding was 
employed for western blot analysis or flow cytometry analysis in order to make comparisons between specific treatments. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 

L_| Antibodies [| ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Atezolizumab (A1305, Biovision, lot 4K23A13050, usage varied) 
Rabbit anti-pAKT (S473) (9271, Cell Signaling Technologies, lot 14, 1:1000) 
Rabbit anti-pEGFR (Y1068) (2234, Cell Signaling Technologies, lot 21, 1:1000) 


Rabbit anti-p44/42 MAPK (4376, Clone 20G11, lot 18, Cell Signaling Technologies, 1:1000) 


Rabbit anti-EGFR (4267, Clone D38B1, lot 19, Cell Signaling Technologies, 1:1000) 


1240129 


Goat anti-EGFR (AF231, R&D Systems, lot AUC1217071, 1 ug/mL) 


810@ 


Mouse anti-sCD71 (VMA00037, Bio-Rad, lot 09071, 1:1000 


9 
e 


Rabbit anti-PD-L1 (13684, Clone E1L3N, Cell Signaling Technologies, lot 7, 1:500) 


Mouse anti-Vinculin (MCA465GA,Clone V284, Bio-Rad, lot 1806, 1:1000) 


Mouse anti-alpha-Tubulin (T5168, Clone B-5-1-2, Millipore Sigma, 1:5000) 

Rabbit anti-EXOC1 (ab118798, Abcam, 1:2500) 

Rabbit anti-CIM6PR (14364, Clone D3V8C, Cell Signaling Technologies, lot 1, 1:1000) 
ouse anti-CD71 (BEO023, Clone OKT-9, BioXCell, usage varied) 

ouse anti-PD-L1 (BEO285, Clone 29E.2A3, BioXCell, usage varied) 

ouse anti-mCherry (orb66657, Clone 5A12A7H10, Biorbyt, usage varied) 

ouse anti-ApoE4 (NBP1-49529, Clone 4E4, Novus Biologicals, lot E-2, usage varied) 


Goat anti-Mouse IgG (115-005-062, Jackson ImmunoResearch, usage varied) 


ouse anti-biotin-Alexa Fluor 488 (200-542-211, Jackson ImmunoResearch, lot 119967, usage varied) 
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Cetuximab, obtained from Eli Lilly (usage varied, 0.1 UM when used for flow cytometry staining, batch IMF406) 
Mouse anti-CIM6PR Abcam (ab2733, Clone 2G11, Abcam, lot GR310738-6, 1:500) 

Goat anti-mouse IgG-Alexa Fluor 488 (115-545-008, Jackson ImmunoResearch, 1:375) 

Goat anti-mouse IgG-Alexa Fluor 647 (115-605-071, Jackson ImmunoResearch, 1:375) 

Goat anti-human IgG-Alexa Fluor 647 (109-605-003, Jackson ImmunoResearch, 1:375) 


Goat anti-Rabbit IgG 800CW (926-32211, LI-COR, 1:10000) 


Goat anti-Mouse IgG 800CW (926-32210, LI-COR, 1:10000) 
Donkey anti-Goat IgG 800CW (926-32214, LI-COR, 1:10000) 
Goat anti-Rabbit IgG 680LT (926-68021, LI-COR, 1:10000) 


Goat anti-Mouse IgG 680LT (926-68020, LI-COR, 1:10000) 


Goat anti-Human IgG 800CW (926-32232, LI-COR, 1:1000) 


Validation All antibodies were used for applications validated by antibody suppliers per quality assurance provided by each supplier. 


Atezolizumab (A1305, Biovision): https://www.biovision.com/anti-pd-l1-atezolizumab-humanized-antibody.html. Tecentriq is the 
United States FDA-approved antibody against PD-L1. Binding to PD-L1 verified by knockdown measured by blot with E1L3N (Cell 
Signaling Technologies) in Fig.4. 


anti-pAKT (S473) (9271, Cell Signaling Technologies): https://www.cellsignal.com/products/primary-antibodies/phospho-akt- 
ser473-antibody/9271. Validated from manufacturer's website and citations therein, validation included stimulation with AKT 
activating ligands (i.e. PDGF) as well transfection with phospho-inactive AKT mutants. 


anti-pEGFR (Y1068) (2234, Cell Signaling Technologies): https://www.cellsignal.com/products/primary-antibodies/phospho-egf- 
receptor-tyr1068-antibody/2234. Validated from manufacturer's website and citations therein, validation included stimulation 
with EGF. 


anti-p44/42 MAPK (4376, Clone 20G11, Cell Signaling Technologies): https://www.cellsignal.com/products/primary-antibodies/ 
phospho-p44-42-mapk-erk1-2-thr202-tyr204-20g11-rabbit-mab/4376. Validated from manufacturer's website and citations 
therein. Validation included comparison with p38 and protein levels in the presence and absence of MAPK activating ligands. 


anti-EGFR (4267, Clone D38B1, Cell Signaling Technologies): https://www.cellsignal.com/products/primary-antibodies/egf- 
receptor-d38b1-xp-rabbit-mab/4267. Validated from manufacturer's website and citations therein, validation included EGFR 
knock-out. 


anti-EGFR (AF231, R&D Systems): https://www.rndsystems.com/products/human-egfr-antibody_af231#product-details. 
Validated from manufacturer's website and citations therein, results cross-validated with anti-EGFR D38B1 (Cell Signaling 
Technologies) by comparison with Extended Data Fig. 5. 


1240129 


L0¢ 


anti-sCD71 (VMA00037, Bio-Rad): https://www.bio-rad-antibodies.com/monoclonal/human-cd71-soluble-antibody- 
vma00037.html?f=antibody%20only&thirdPartyCookieEnabled=false. Validated from manufacturer's website. Manufacturer's 
statement: "The PrecisionAb label is reserved for antibodies that meet the defined performance criteria within Bio-Rad's ongoing 
antibody validation programme." 


o 
fo} 


anti-PD-L1 (13684, Clone E1L3N, Cell Signaling Technologies): https://www.cellsignal.com/products/primary-antibodies/pd-I1- 


Eukaryotic cell lines 


e113n-xp-rabbit-mab/13684. Validated from manufacturer's website and citations therein, validation included demonstration of 
PD-L1 increases in the presence of absence of IFN-y. 


anti-Vinculin (MCA465GA,Clone V284, Bio-Rad): https://www.bio-rad-antibodies.com/monoclonal/human-vinculin-antibody- 
v284-mca465.html ?f=purified&thirdPartyCookieEnabled=false. Validated from manufacturer's website and citations therein. 


anti-alpha-Tubulin (T5168, Clone B-5-1-2, Millipore Sigma): https://www.sigmaaldrich.com/catalog/product/sigma/t5168? 
lang=en&region=US. Validated from manufacturer's website and citations therein. 


anti-EXOC1 (ab118798, Abcam): https://www.abcam.com/exoc1-antibody-ab118798.html. Validated from manufacturer's 
website and citations therein, validated with CRISPRi knockdown in Fig. 2 of this work. 


anti-CIM6PR (14364, Clone D3V8C, Cell Signaling Technologies): https://www.cellsignal.com/products/primary-antibodies/igf-ii- 
receptor-ci-m6pr-d3v8c-rabbit-mab/14364. Validated from manufacturer's website and citations therein, validated with CRISPRi 
knockdown in Fig. 2 of this work. 


anti-CD71 (BEO023, Clone OKT-9, BioXCell): https://bxcell.com/product/h-cd71/. Validated from manufacturer's website and 
citations therein. 


anti-PD-L1 (BEO285, Clone 29E.2A3, BioXCell): https://bxcell.com/product/invivomab-anti-human-pd-|1-b7-h1/. Validated from 
manufacturer's website and citations therein. 


anti-mCherry (orb66657, Clone 5A12A7H10, Biorbyt): https://www.biorbyt.com/mcherry-antibody-orb66657.html. Validated 
from manufacturer's website and citations therein. Validated by uptake of recombinant mCherry in Fig. 3. 


anti-ApoE4 (NBP1-49529, Clone 4E4, Novus Biologicals): https://www.novusbio.com/products/apoe4- 
antibody-4e4_nbp1-49529. Validated from manufacturer's website and citations therein. 


goat anti-Mouse IgG (115-005-062, Jackson ImmunoResearch): https://www.jacksonimmuno.com/catalog/ 
products/115-005-062. Validated from manufacturer's website and citations therein. 


mouse anti-biotin-Alexa Fluor 488 (200-542-211, Jackson ImmunoResearch): https://www.jacksonimmuno.com/catalog/ 
products/200-542-211. Validated from manufacturer's website and citations therein, validated by investigators performing 
uptake experiments with anti-mouse IgG (Extended Data Fig. 4). 


Cetuximab (anti-EGFR) is an United States FDA-approved antibody against EGFR. 


anti-CIM6PR (ab2733, Clone 2G11, Abcam): https://www.abcam.com/m6pr-cation-independent-antibody-2g11-ab2733.html. 
Validated from manufacturer's website and citations therein, validated in Fig. 2 by CRISPRi knockdown. 


Goat anti-mouse IgG-Alexa Fluor 488 (115-545-008, Jackson ImmunoResearch): https://www.jacksonimmuno.com/catalog/ 
products/115-545-008. Validated from manufacturer's website and citations therein. 


Goat anti-mouse IgG-Alexa Fluor 647 (115-605-071, Jackson ImmunoResearch): https://www.jacksonimmuno.com/catalog/ 
products/115-605-071. Validated from manufacturer's website and citations therein. 


Goat anti-human IgG-Alexa Fluor 647 (109-605-003, Jackson ImmunoResearch): https://www.jacksonimmuno.com/catalog/ 
products/109-605-003. Validated from manufacturer's website and citations therein. 


Goat anti-Rabbit IgG 800CW (926-32211, LI-COR): https://www.licor.com/bio/reagents/irdye-800cw-goat-anti-rabbit-igg- 
secondary-antibody. Validated from manufacturer's website. 


Goat anti-Mouse IgG 800CW (926-32210, LI-COR): https://www.licor.com/bio/reagents/irdye-800cw-goat-anti-mouse-igg- 
secondary-antibody. Validated from manufacturer's website. 


Donkey anti-Goat IgG 800CW (926-32214, LI-COR, 1:10000): https://www.licor.com/bio/reagents/irdye-800cw-donkey-anti-goat- 
igg-secondary-antibody. Validated from manufacturer's website. 


Goat anti-Rabbit IgG 680LT (926-68021, LI-COR, 1:10000): https://www.licor.com/bio/reagents/irdye-680It-goat-anti-rabbit-igg- 
secondary-antibody. Validated from manufacturer's website. 


Goat anti-Mouse IgG 680LT (926-68020, LI-COR, 1:10000): https://www.licor.com/bio/reagents/irdye-680It-goat-anti-mouse-igg- 
secondary-antibody. Validated from manufacturer's website. 


Goat anti-Human IgG 800CW (926-32232, LI-COR, 1:1000): https://www.licor.com/bio/reagents/irdye-800cw-goat-anti-human- 
igg-secondary-antibody. Validated from manufacturer's website. 


Policy information about cell lines 


Cell line source(s) 


Jurkat, HeLa, MDA-MB-361, MDA-MB-231, THP-1, K562, HEK293, Hep3B, HepG2 and BT474 were obtained from the 
American Tissue Culture Collection (ATCC), HDLM-2 were obtained from the laboratory of Dr. Thomas Waldmann (NIH), 
CRISPRi cell lines were obtained from Dr. Michael Bassik's laboratory (Stanford University) 
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Authentication Cell lines were authenticated by the supplier, HDLM-2 and CRISPRi lines were not subjected to additional authentication. 
Mycoplasma contamination Cell lines were verified to be mycoplasma negative by Lonza MycoAlert Mycoplasma Detection Assay. 


Commonly misidentified lines No commonly misidentified lines were used. 
(See ICLAC register) 


Animals and other organisms 
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Laboratory animals BALB/c mice, female, 6-8 weeks old 

Wild animals No wild animals were involved in this study. 
Field-collected samples This study did not involve collecting samples from the field. 
Ethics oversight IACUC and Stanford APLAC approval# 31511 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation For protein uptake experiments in suspension cell lines, cells were incubated for the indicated time with proteins and 
conjugates, then washed three times with PBS at 4 °C. Cells were then incubated with either Sytox Green or Sytox Red according 
to the manufacturer’s specifications for 15 minutes on ice. For protein uptake experiments in adherent cell lines, cells were 
incubated for the indicated time with proteins and conjugates, then washed two times quickly with PBS and lifted with trypsin. 
Cells were transferred to a 96-well V-bottom plate and washed three times with PBS, then incubated with either Sytox Green or 
Sytox Red according to the manufacturer’s specifications for 15 minutes on ice. For surface staining experiments, adherent cells 
were lifted with non-enzymatic cell dissociation buffer (Gibco) and transferred to a 96-well V-bottom plate, and suspension cells 
were transferred to a 96-well V-bottom plate. After washing three times with 0.5% BSA in PBS, cells were incubated with primary 
antibody for 30 minutes on ice. Cells were subsequently washed three times with 0.5% BSA in PBS, then incubated with 
secondary antibodies for 30 minutes on ice. After incubation, cells were washed three times with 0.5% BSA in PBS and incubated 
with either Sytox Green or Sytox Red according to the manufacturer’s specifications for 15 minutes on ice. 


Instrument BD-Accuri C6 Plus and DxP FACScan for analysis, BD Aria II for sorthing 
Software BD CSampler Plus or FACS Diva were used for data collection, data analysis was performed using the FlowJo software package. 
Cell population abundance N/A 


Gating strategy Gating was based on FSC and SSC area, followed by dead cell removal using a Sytox stain, and doublet removal using FSC-H vs 
FSC-A. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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peblished oabne adn 2020 Metabotropic y-aminobutyric acid receptors (GABA,) are involved in the modulation 


of synaptic responses in the central nervous system and have been implicated in 
neuropsychological conditions that range from addiction to psychosis’. GABA, 
belongs to class C of the G-protein-coupled receptors, and its functional entity 
comprises an obligate heterodimer that is composed of the GB1 and GB2 subunits’. 
Each subunit possesses an extracellular Venus flytrap domain, which is connected toa 
canonical seven-transmembrane domain. Here we present four cryo-electron 
microscopy structures of the human full-length GBI-GB2 heterodimer: one structure 
of its inactive apo state, two intermediate agonist-bound forms and an active formin 
which the heterodimer is bound to an agonist and a positive allosteric modulator. The 


® Check for updates 


structures reveal substantial differences, which shed light on the complex motions 
that underlie the unique activation mechanism of GABAg. Our results show that 
agonist binding leads to the closure of the Venus flytrap domain of GBI, triggering a 
series of transitions, first rearranging and bringing the two transmembrane domains 
into close contact along transmembrane helix 6 and ultimately inducing 
conformational rearrangements in the GB2 transmembrane domain viaa lever-like 
mechanism to initiate downstream signalling. This active state is stabilized by a 
positive allosteric modulator binding at the transmembrane dimerization interface. 


y-Aminobutyric acid (GABA) is the principal inhibitory neurotransmit- 
ter in the central nervous system, counteracting the main excitatory 
neurotransmitter glutamate and exerting its function via two types of 
receptor: ionotropic GABA, receptors, which mediate fast responses’, 
and metabotropic G-protein-coupled GABA, receptors, which elicit 
slow, prolonged activity—primarily through G,,, proteins'*. GABA, 
functions both pre-synaptically to inhibit the release of neurotrans- 
mitters, and post-synaptically, which leads to hyperpolarization of 
the neuron. Given the central role of GABA, in neurobiology, it has 
previously been linked to various neurological diseases, pain regula- 
tion and addiction’ and represents a major pharmacological target®. 
Therapeutic drugs that target GABA;—such as baclofen (sold under 
the brand name Lioresal) and B-phenyl-y-aminobutyric acid (also 
knownas phenibut)—have been used to treat spasticity’, alcohol addic- 
tion”, anxiety andinsomnia”. Auto-antibodies that target GABA, have 
previously been identified at the origin of epilepsies and encephalitis”, 
and mutations in GABBR2 (which encodes GB2) have previously been 
associated with Rett syndrome and epileptic encephalopathies"* 
GABA, belongs to class C of the G-protein-coupled receptors 
(GPCRs), along with calcium sensing (CaS), metabotropic glutamate 
(mGlu) and taste 1 (TAS1) receptors”. As opposed to mGlu and CaS 
receptors (which function in homodimeric forms), the association of 
two distinct subunits—GB1 and GB2—is required for GABA, function, 


although the formation of higher-order GABA, oligomers has also pre- 
viously been proposed’. The heterodimeric assembly is stabilized by 
the interaction of the two subunits through an intracellular coiled-coil 
domain, which masks the di-leucine internalization (°°°EKSRLL®”) and 
endoplasmic reticulum retention °“RSRR””*) signals of GB1 and thus 
allows the trafficking of the heterodimer to the cell surface’’”°. GABA, 
heterodimers use a unique allosteric mechanism for signal transduc- 
tion, in which the binding of an agonist in the extracellular Venus flytrap 
domain (VFT) of GB1 leads to G protein activation through a rearrange- 
ment of the intracellular interface of the transmembrane domain (TMD) 
of GB2”. The structural basis of this allosteric mechanism has previ- 
ously been studied by X-ray crystallography of soluble VFT heterodi- 
mers, which has revealed molecular rearrangements within the VFTs 
uponligand binding”. Although this information is instrumental for our 
understanding of GABA, signal transduction”, it lacks key insights into 
the mechanism of TMD transitions associated with receptor activation. 

Here we used single-particle cryo-electron microscopy (cryo-EM) 
to obtain full-length GABA, structures in four distinct conformations 
(Fig. 1) along the receptor activation path from an inactive apo state to 
the active state, stabilized by both an agonist and a positive allosteric 
modulator (PAM). These structures reveal how the agonist-induced 
transformations at the level of the VFT heterodimer lead to conforma- 
tional changes of the TMDs to trigger intracellular signalling. 


"Bridge Institute, USC Michelson Center for Convergent Biosciences, University of Southern California, Los Angeles, CA, USA. *Department of Chemistry, University of Southern California, Los 
Angeles, CA, USA. ?Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA. “Institut de Génomique Fonctionnelle (IGF), CNRS, INSERM, Université de 
Montpellier, Montpellier, France. *Biosciences Division, SLAC National Accelerator Laboratory, Menlo Park, CA, USA. ‘Department of Structural Biology, Stanford University, Stanford, CA, USA. 
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Inactive (4 A) Int-1 (6.3 A) 
apo SKF97541 
Fig. 1| Cryo-EM maps and models of the GABA, heterodimer. a, b, Maps (a) 
and models (b) are shown for four different conformations of GABAg, including 
active and inactive states and two intermediate conformations. The numbers 
in parentheses indicate the estimated resolution of the cryo-EM maps. GBland 


Determining the cryo-EM structure 


After establishing expression and purification protocols using the 
co-expression of both GB1 and GB2 subunits in insect cells (Extended 
Data Fig. 1), we collected an initial dataset of the GB1-GB2 heterodi- 
mer in digitonin micelles in the presence of 3-aminopropyl(methyl) 
phosphinic acid (SKF97541), a GABA; agonist that is ten times more 
potent than baclofen. SKF97541 has previously been shown to pen- 
etrate the central nervous system”, and has demonstrated antidepres- 
sant activity in vivo”®. We verified that both ligand binding and the 
signalling response of the GABA, construct we used for determining 
the cryo-EM structure are identical to those of the wild-type recep- 
tor (Fig. 2e, f). Our cryo-EM data revealed two substantially different 
conformations of GABAg, one at a resolution of 6.3 A and the other 
at 4.8 A (Fig. 1, Extended Data Figs. 2, 3, Extended Data Table 1). The 
first reconstruction showed poorly resolved TMDs, whereas the VFTs 
adopted a conformation that was intermediate between the previously 
described apo and agonist-bound crystal structures”. The second 
reconstruction showed an active-like form, in which the two TMDs were 
in contact with each other, and the VFTs matched the agonist-bound 
crystal structure (Protein Data Bank code (PDB) 4MS3) within a root 
mean square deviation (r.m.s.d.) of 2.1A. We attributed the limited 
resolution of these structures to the fact that the agonist-bound recep- 
tor was in a dynamic equilibrium between the two states, which we 
dubbed intermediate state 1 and 2 (int-1 and int-2), and could not be 
locked in a fully active conformation without a G protein or another 
allosteric partner. 

Inanattemptto stabilize the receptor inits active state, we collecteda 
second dataset of GABA, inthe presence of both the agonist SKF97541 and 
the PAM GS39783, which has previously demonstrated anxiolytic activ- 
ity without the side effects that are associated with baclofen”. Indeed, 
we observed a single receptor conformation that achieved an overall 
resolution of 3.6 A (Figs. 1, 3a, Extended Data Figs. 3, 4, Extended Data 
Table 1), enabling confident modelling of most amino acid side chains. 

Finally, we collected a third dataset of the ligand-free (apo) form of 
the receptor at 4 A resolution. This structure differs markedly fromall 


SKF97541 
GS39783 
=) 


Active (3.6 A) 
SKF97541 and GS39783 


Int-2 (4.8 A) 

SKF97541 
GB2 are coloured in blue and yellow, respectively. The agonist (SKF97541)- and 
PAM (GS39783)-bound state is labelled as active, with a caveat that the fully 
active state can only be observed in complex with aG protein or other 
transducer. 


of the ligand-bound states we reconstructed, and represents the start- 
ing point of the activation pathway (Figs. 1, 2a, Extended Data Figs. 3, 
5, Extended Data Table 1). 


Overall structure of GABA, heterodimer 


The overall GABA, structure reveals a heterodimeric arrangement of the 
two subunits (Figs. 1, 2a), in which each subunit contains an extracellular 
VFT attached to the canonical TMD viaa stalk domain. Both VFTs consist 
of two lobes (which we refer to as LB1 and LB2) connected by a hinge 
that allows the sampling of a range of conformations, from fully open 
to fully closed. The heterodimeric association of the VFTs is mediated 
by the two LBIs interacting with each other ina side-by-side orientation, 
facing opposite directions. The structure of the stalk is distinct from the 
cysteine-rich domain that connects the VFT and TMD domains in other 
class-C GPCRs*’. The stalk consists of a twisted three-stranded B-sheet 
that bundles the linker between the VFT and TMD with the long extra- 
cellular loop 2 (ECL2) of the TMD (Fig. 2c). This secondary structure 
element ensures a strong pairing between the two main domains, by 
directly connecting the VFT with transmembrane helices TM1, TM4 and 
TMS, as well as with TM3 througha conserved disulfide bond (C*“'?-C?; 
superscript refers to generic residue numbering in class-C GPCRs”’). 
Thejunction sites between the stalks and the VFTs in both subunits are 
strengthened by ionic interactions between positively and negatively 
charged residues, which sustained throughout our molecular dynamics 
simulations—despite fluctuations of the VFTs (Fig. 2c, d). Bending and 
twisting modes of the VFTs were observed in the inactive and active 
states, respectively, and there were stronger ionic contacts and smaller 
amplitudes of motion inthe active state (Extended Data Fig. 6a, b, d,e). 


Effects of agonist binding 

Our apo structure at 4 A resolution allowed us to model the majority 
of side chains and detailed arrangements within the main structural 
domains of GB1-GB2 (Fig. 2a). We observed that in the apo state the 
two GABA, subunits interact through their LB1s and the intracellular 
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Fig. 2 | Structural details of GABA, in the inactive apo state. a, Overall view of 
the apo GABA, model and map. b, Extracellular view of the GABA, TMD in the 
inactive apo state. c, Structure of the stalk domains. The junctions between the 
stalks and the VFTs are stabilized by a network of electrostatic interactions 
between positively (red) and negatively (blue) charged residues. The disulfide 
bond between ECL2 and TM3 is shownas yellow sticks. d, Distribution of distances 
for ionic interactions between the stalk and VFT in GB1 and GB2 obtained from 
molecular dynamics simulations of apo GABAg. e, IP, production mediated by the 
wild-type receptor (blue) or the construct used for cryo-EM (red) upon 
stimulation with GABA (solid line) or SKF97541 (dotted line). f, Displacement of 
the non-permeant antagonist CGP54626-DY647 by GABA (solid line) or SKF97541 
(dotted line) from the wild-type receptor (WT, blue) or the construct used for 
cryo-EM (EM, red). Data shown in e and fare normalized by the wild-type response 
and presented as mean +s.e.m. of four biologically independent experiments. 


tips of TM3 and TMS (Extended Data Figs. 5i, 6c). Consistent with the 
crystal structure of the soluble heterodimeric apo VFT (PDB 4MQE) 
(r.m.s.d. = 1.3 A) (Extended Data Fig. 7a), both VFTs are in a fully 
open conformation with a maximal separation of 47.6 A between the 
C-terminal ends of the LB2s. 

Agonist binding results in two intermediate states that exist ina 
dynamic equilibrium (Fig. 1, Supplementary Video 1). Inthe int-1state, 
the VFT of GB1is partially closed at about 20% of the total amplitude, 
as compared to the fully closed and fully open crystal structures. This 
closure is accompanied by a rotation of both VFTs with respect to the 
TMDs (Fig. 4a). At the same time, the two LB2s remain separated and 
the TMDs face each other via TMS. In addition, the TM3-TMS interac- 
tion between the TMDs is probably released in the int-1 state, which 
would result in a more dynamic conformation—this is underlined by 
the substantially lower resolution of this structure. 

The int-2 state shows a complete closure of the VFT of GBI, consist- 
ent with the previous crystal structures of the agonist-bound VFTs”. 
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Fig.3| Structure ofthe active-state GABA,, and details of agonist and PAM 
binding. a, Overall view of GABA, model and map in the active agonist- and 
PAM-bound conformation. Elongated density interacting with the intracellular 
part of GB2 resembles the coiled-coil domain. b, Extracellular view of GABA, 
TMD inthe active conformation. c, Enlarged view of the ligand-binding pocket in 
the VFT of GB1. SKF97541 activates GABA, by interacting with LB1 (blue) and LB2 
(teal) of GB1. d, Binding of the PAM (GS39783) further stabilizes the GABA, active 
state by interacting with TMD residues from both GB1land GB2. SKF97541 and 
GS39783 are shownas sticks with carbon atoms coloured in sand; oxygenin red; 
nitrogen in blue; phosphorus in light green; and sulfur in dark green.e, IP, 
production induced by 10 1M GS39783 in intact HEK293 cells that express the 
indicated subunit combinations. Values are mean + s.d. from seven biologically 
independent experiments. Data are normalized by GABA response and analysed 
using one-way analysis of variance (ANOVA) with Dunnett’s multiple comparison 
test to determine significance (compared with wild type), with ****P< 0.0001 for 
all, except for GB1(M807A) (***P= 0.013). f, pKi values for GABA, determined from 
displacement of CGP54626-DY647 binding in intact cells that express the 
indicated subunit combinations in the absence or presence of lor 5 1M GS39783. 
The number of biologically independent experiments is given in Extended Data 
Table 2. Data were analysed using one-way ANOVA with Dunnett’s multiple 
comparison test to determine significance (compared with no GS39783 for the 
same combined subunits). ****P< 0.0001; not significant (NS), P> 0.05. For1piM 
or 5 uM GS39783, P=0.9994 or 0.8166, respectively, for GB2(Y697A); 0.9672 or 
0.8849, respectively, for GB2(N698A); 0.9934 or 0.4946, respectively, for 
GB2(MYN-AAA); 0.5504 or 0.2972, respectively, for GB1(M807A); 0.9981 or 
0.5843, respectively, for GB1(Y810A); 0.9605 or 0.7139, respectively, for 
GB1(N811A); and 0.8819 or 0.9318, respectively, for GB1(MYN-AAA). In 

e, f, GBI(MYN-AAA) refers to a triple mutant (M807A/Y810A/N811A) of GBI, and 
GB2(MYN-AAA) refers to a triple mutant (M694A/Y697A/N698A) of GB2. 
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This closure brings the two LB2s in contact, and decreases the distance 
between their C-terminal ends from 42 to 32 A (Fig. 4a). This conforma- 
tional rearrangement of the VFTs propagates through the stalks, which 
leads to a relative motion of the TMDs that can be presented as a30A 
translation of the TMD of GB2 with respect to the stationary TMD of GB1 
(Fig. 4a, b). Asaresult of these transformations, the two TMDscomein 
direct contact through an interface along TM6 (Extended Data Fig. 4g), 
consistent with previous cross-linking studies”. A similar interaction 
along TM6 has also previously been observed in the agonist-bound 
mGlu5 homodimer structure”®. The conformational rearrangement 
of the TMDs completely changes not only the heterodimer interface 
(Fig. 4b), but also the overall dynamics of the receptor domains, as 
observed in our molecular dynamics simulations and normal mode 
analyses (Figs. 2d, 4c, Extended Data Fig. 6d, e). 


PAM stabilizes the active-state conformation 

The addition of a PAM along with the orthosteric agonist stabilized 
the receptor conformation ina state similar to int-2 (r.m.s.d. = 1.0 A), 
which resulted in a reconstruction with an overall resolution of 3.6 A 


Fig. 4| Activation-related transitions in GABAg. a, Overall view of apo GABAg, 
indicating the conformational changes from inactive apo throughtwo 
intermediate agonist-bound to active agonist- and PAM-bound states. 
Activation, initiated by the agonist binding and aclosure of the VFT of GB1 
(blue arrows), propagates to large-scale translations of the TMDs in the int-2 
state (green arrows). PAM binding induces an additional conformational 
change within the TMD of GB2 (magenta arrows). b, Extracellular view of the 
GABA, TMDs in the inactive and active states aligned by the GB1 TMD 
demonstrates an approximately 30 A (black arrows) relative movement of the 
GB2 TMD. During this transition, the interface between TMDs changes from 
TMS (inactive state) to TM6 (active state); the TM6-TM6 distance (red dash) 
decreases from 30.3 A to 8.4 Aand the TM5-TM5S distance (olive dash) increases 
from13.1A to37A, consistent with previously published cross-linking data*’. 

c, Distribution of the TM5-TMS and TM6-TM6 distances in the inactive apo and 
active agonist- and PAM-bound states as observed in molecular dynamics 
simulations. d, Shifts of TM3 and TMS by about 6A on the intracellular side of 
GB2 upon agonist and PAM binding open acleft for the engagement ofa 
Gprotein. e, Cartoon illustrates activation-related transitions from the inactive 
apostate, through the agonist-bound intermediate states, to the agonist- and 
PAM-bound active state. In the inactive state, both VFTs adopt fully open 
conformations, and their LB2s are well-separated. Agonist binding closes the 
VFT of GBI, bringing the LB2s in contact. These transformations at the level of 
VFTs propagate through the stalks towards the TMDs, resulting in their mutual 
reorientation and formation ofa contact interface along TM6. PAM binding 
further stabilizes the active state, leading to substantial shifts of the 
intracellular ends of TM3 and TMS, which open up acleft for the engagement of 
Gproteins or other transducers. EC, extracellular; IC, intracellular. 


(Fig. 3a) and enabled us to observe activation-related transitions within 
the TMD of GB2. 

The high-resolution structure also revealed the binding details of the 
orthosteric agonist SKF97541, a close analogue of GABA in which the 
carboxylic acid moiety is replaced by methylphosphinic acid. Similar to 
the interactions of GABA inits complex with the VFT of GB1(PDB4MS3), 
binding of SKF97541 is largely defined by its acidic and basic groups. 
The basic amino moiety of SKF97541 forms polar and ionic interac- 
tions with H287 and E466, and the phosphinic acid of SKF97541 makes 
key hydrogen bonds to S247, S270 and Y367 (Fig. 3c, Supplementary 
Video 2). The ligand is also sandwiched between the bulky aromatic 
side chains of W182 and W395. Our molecular dynamics simulations 
confirmed the stability of these interactions (Extended Data Fig. 6f, j). 
Most residues interacting with SKF97541 and GABA belong to LB1 of 
the VFT of GBI, but two of them—Y367 and W395—reside on LB2 and 
shift by about 2 A between the int-1 and int-2 states. 

The most substantial differences between the agonist-bound (int-2) 
structure and the agonist- and PAM-bound structure occur within 
the TMD of GB2, in which TM3 and TMS straighten and shift on the 
intracellular side by about 6 A whereas TM6, which is considered a 
hallmark of activation in class-A and -B GPCRs, does not move sub- 
stantially (Fig. 4a, d). By contrast, the conformation of the TMD of GB1 
remains mostly unchanged (Extended Data Fig. 7c-f). Notably, the 
first three structures of the activation cycle (the apo, int-1, and int-2 
states) show the same conformation of TM3 and TMS in the TMD of 
GB2. The ionic lock between K*° and D°*», which is highly conserved 
among class-C GPCRs and is observed in all inactive mGlu crystal 
structures, is intact in GB1 but broken in GB2 in the PAM-boundstate. 
These conformational changes in the TMD of GB2 are most probably 
required to accommodate binding of the G,,, protein for the initiation 
of intracellular signalling. This mechanism is further supported by 
the local resolution and B-factor comparison of GB2 (Extended Data 
Fig. 4e, f), which demonstrates that the intracellular ends of TM3, 
TM4 and TMS are the most dynamic parts of the structure. Directly 
connected to these helices, the intracellular loop 2 (ICL2) and ICL3 
of GB2 have previously been identified as crucial for downstream 
signalling??*), 
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PAM binding site at the heterodimer interface 


GS39783 has previously been characterized as a PAM of GABA,; how- 
ever, little is known about its binding site and its mechanism of action”. 
We confirmed a positive allosteric effect of GS39783 on both ligand 
binding and the signalling response of GABA. Additionally, using 
highly amplified inositol monophosphate (IP) assays, we observed 
that GS39783 alone behaves as an agonist (Extended Data Fig. Ic, d, 
Extended Data Table 2). After building and refining the agonist- and 
PAM-bound GABA, model, we identified two putative binding sites for 
GS39783 with unexplained residual density. Site 1is located inside the 
TMD of GB2 and site 2 is observed at the interface between the TMDs of 
GBland GB2 (Fig. 3d, Supplementary Video 3). Mutations of residues 
in site linside the TMD of GB2 did not affect either the allosteric or 
the agonist action of GS39783, which suggests that this is not the main 
binding site for GS39783 (Extended Data Figs. 8, 9). 

Onthe other hand, the mutation of residues in site 2—which is located 
at the TM6 heterodimer interface—unambiguously confirmed it as the 
site of action for GS39783 (Fig. 3e, f, Extended Data Figs. 8,9). TM6 has 
a major role in the activation of GABA, as the sole interface between 
TMDs, and the binding of GS39783 in between the TM6s can stabilize 
this interaction. In this site, GS39783 fits well in the experimental den- 
sity and is anchored by a hydrogen bond with N698°* of GB2 anda 
stacking interaction with Y810°“ of GBI (Fig. 3d). Other residues that 
shape the pocket and interact with GS39783 include Y789°°’, K792°”, 
and M807* of GBI, as well as M694°“' and Y697°* of GB2. These inter- 
actions are largely preserved in the molecular dynamics simulations 
of the complex (Extended Data Fig. 6f, i). 

We therefore propose that GS39783 has a dual role in GABA, activa- 
tion by stabilizing the TM6-mediated dimerization interface” and 
facilitating conformational transitions within the TMD of GB2. 


Discussion 


Inthis work, we have determined the 3D structure of the pharmacologi- 
cally important GABA, in four distinct conformations along its activa- 
tion trajectory from an inactive apo to an agonist- and PAM-stabilized 
active state (Supplementary Video 1), enabling us to propose a model 
of signal transduction by GABA, and its modulation by a PAM (Fig. 4e). 
Upon binding of an agonist, a compaction of the VFTs brings the two 
TMDs closer together and reorients them to allow direct contact along 
TM6, consistent with the proposed model based on TMD cross-linking 
experiments” (Fig. 4b). Overall, the observed domain rearrangements 
associated with the activation of GABA, resemble the previously pub- 
lished structures of mGlu5”’, which suggests that this activation mecha- 
nism is probably conserved among class-C GPCRs. However, there are 
three major differences between mGlu receptors and GABA,: GABA, is 
a mandatory heterodimer of two noncovalently associated subunits, 
whereas mGlu receptors function as homodimers with two identical 
subunits linked via a disulfide bond; although GABA binding ina single 
subunit (GBI) is sufficient to fully activate the receptor, glutamate 
binding in both mGlu subunits is required for full activation**; and 
GABA, lacks a cysteine-rich domain, which is present in each mGlu 
subunit and connects the VFT with the TMD, increasing the overall size 
of the receptor. Notably, our GABA, structures revealed the presence 
of a relatively rigid stalk composed of the ECL2 hairpin and the linker 
connecting the VFT with TMD. Similar to the cysteine-rich domain in 
mGlu receptors, this stalk probably transduces the signal from the 
VFTs to the TMDs, acting as a lever. 

In contrast to the mGlu5 study”’ (in which no conformational changes 
within the TMDs were reported), we succeeded in reaching an additional 
activation-related step and captured a PAM-stabilized state of GABA, 
with marked changes within the TMD of GB2. Unlike class-A and -B 
GPCRs (in which receptor activation is associated with a pronounced 
outward movement of TM6), the amplitude of TM6 motion in GABA, 
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is constrained by the heterodimeric interface between two subunits. 
Instead, we observed notable movements of TM3, TM4 and TMS that 
opened a cleft on the intracellular side of GB2, potentially to engage 
aG protein or other transducers. This conformational change may be 
favoured by the interaction between TM6 from each subunit resulting 
from the reorientation of the TMDs upon receptor activation, which 
possibly explains the role proposed for the TMD of GB1in the activation 
process*°*’, We identified the location of a binding site for GS39783 at 
the interface between the TMDs of the two subunits. GABA, PAMs can 
be attractive alternatives to orthosteric agonists owing to their thera- 
peutic potential; however, the lack of knowledge about their binding 
and mechanism of action has hindered progress in their development. 

Our results provide detailed insights into the structural transforma- 
tions that are associated with the activation of GABAg, shedding light on 
the intrinsically asymmetric allosteric mechanism in which the binding 
of aligand in the VFT of GB1 translates via the stalks into marked rear- 
rangements of the TMD of GB2. Such an asymmetric transition under- 
lines the unique activation process of GABA,. Finally, the structural 
differences between each of the states give important insights into the 
pharmacological intervention of this receptor, and could potentially 
result in the design of new allosteric modulators. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Expression of GABA, 
This study used two separate constructs with cleavable N- and 
C-terminal tags for GB1 and GB2. Human GB1 isoform 1A (UniProt: 
Q9UBSS, residues 165-920) was modified to include an N-terminal 
influenza haemagglutinin signal sequence and a C-terminal eGFP tag. 
Additionally, human GB2 (UniProt: 075899, residues 42-821) was modi- 
fied to include an N-terminal influenza haemagglutinin signal sequence 
following a3x Flag epitope. The N- and C-terminal tags are removable 
by tobacco etch virus protease (TEV) and human rhinovirus 3C protease 
(PreXx), respectively. The overall strategy of designing the constructs 
was to keep the topology of VFT and coiled-coil domain intact and 
also to provide the ability of pulling down the heterodimeric species. 
Each construct was cloned into a pFastBacl vector (Invitrogen) and 
expressed in Spodoptera frugiperda Sf9 insect cells (ATCC CRL-1711, 
authenticated by supplier using morphology and growth characteris- 
tics, certified mycoplasma-free) using the Bac-to-Bac system (Invitro- 
gen). Cells were co-infected with baculovirus at a density of 2.5-3 x 10° 
cells ml“ in ESF921 medium containing 2% (v/v) production boost addi- 
tive (PBA, Expression Systems) at a multiplicity of infection (MOI) of 8 
for both GBland GB2 constructs. After 48 h cells were collected, washed 
in PBS, and stored at -80 °C until further use. 


Purification of GABA, 

Cells were thawed in low salt wash buffer containing 20 mM 
HEPES-NaOH pH 7.5, 10 mM MgCl,, 20 mM KCl and protease inhibitor 
cocktail (made in-house). One round of dounce homogenization fol- 
lowed by centrifugation at 200,000g was carried out to disrupt intact 
cells and to remove unwanted soluble proteins. Further homogeni- 
zation and centrifugation were done in the presence of 1 M NaCl for 
two rounds to remove cell nuclei and membrane-associated proteins. 
The washed membranes were resuspended and solubilized in buffer 
containing 100 mM HEPES-NaOH pH 7.5, 800 mM NaCl, 10% (v/v) glyc- 
erol, 1% (w/v) n-dodecyl-B-D-maltopyranoside (DDM, Anatrace), 0.2% 
(w/v) cholesteryl hemisuccinate (CHS, Sigma-Aldrich) for 2.5 hat 4 °C. 
Purification was carried out in the presence of 50 uM SKF97541 (Tocris 
Bioscience) for preparation of the agonist-bound sample, and without 
adding ligands for the agonist- and PAM-bound and apo sample. Solu- 
bilized membranes were spun down at 200,000g for 50 min, and the 
resulting supernatant was incubated with anti-Flag M2 affinity gel (Milli- 
poreSigma) overnight at 4 °C. The resin was washed with 15 column vol- 
umes of wash buffer I containing 25 mM HEPES-NaOH pH 7.5, 300 mM 
NaCl, 0.1% (w/v) DDM, 0.02% (w/v) CHS and incubated overnight in the 
same buffer with the addition of 100 yg mI Flag peptide and TEV pro- 
tease (GenScript) at 4 °C. The eluted protein was then incubated with 
anti-GFP nanobody resin (made in-house) and washed with 5 column 
volumes wash buffer 1 and10 column volumes wash buffer II containing 
25 mM HEPES-NaOH pH 7.5, 300 mM NaCl, 0.02% (w/v) DDM, 0.004% 
(w/v) CHS and exchanged to digitonin (MilliporeSigma) by incubating 
the resin with the same buffer containing 1% digitonin for 1h. Elution 
was performed overnight by incubating the resin with PreX protease 
(GenScript) at 4 °C. The protein was finally purified by size-exclusion 
chromatography (SEC) ona Superose 6 Increase 10/300 GL column 
(GE Healthcare) in SEC buffer containing 25 mM HEPES-NaOH pH 7.5, 
300 mM NaCl, 0.1% (w/v) digitonin. Dimeric fractions were pooled, 
supplemented with 100 tM SKF97541 (for the agonist sample) or with 
100 uM SKF97541 and 100 pM GS39783 (for the agonist and PAM sample, 
Tocris Bioscience), stored at 4 °C overnight and concentrated to 
10 mg mI" using a100-kDa cut-off concentrator (Amicon) immediately 
before applying the sample to cryo-EM grids. 


Cryo-EM data acquisition and processing 
For the preparation of cryo-EM grids, Quantifoil grids (Au1.2/1.3, 200 
mesh, Quantifoil) were glow-discharged for 40 s at 30 mA (EasiGlo, Ted 
Pella). A total of 2.5 pl of concentrated GABA, sample was applied to 
each gridina plunge-freezer (Leica GP3), blotted for 5s at 95% relative 
humidity at 20 °C and plunge-frozen into liquid ethane. Frozen grids 
were transferred to liquid nitrogen and stored for data acquisition. 
The first two datasets (agonist- and agonist- and PAM-bound recep- 
tor) described in this study were collected on a Titan Krios (Thermo 
Fisher), equipped witha K3 direct-electron detector (Gatan), without 
energy filter, operated at 300 kV. Each movie had a total exposure time 
of 3.5s with 70 ms per frame readout, at a dose-rate of about 15eA?s, 
corresponding to a total dose of about 50 e A. High-magnification 
images were collected in super-resolution mode, with a correspond- 
ing pixel size of 0.426 A. Automated data collection was performed 
using SerialEM*, with a strategy of applying image shifts to collect 
data from 9 holes (with 3 images per hole) after each stage move- 
ment, to increase throughput. The third, apo dataset was collected 
ona Titan Krios (Thermo Fisher) equipped with an energy filter and 
K2 direct-electron detector (Gatan), operated at 300 KV. The data col- 
lection procedure was similar to the previous two datasets, with the 
difference of anominal pixel size of 1.08 A, a dose rate of about 8eA?s1, 
200-ms exposure time per frame, a total of 8-s exposures with the 
energy filter set to 20 eV slit size and only one acquisition per hole. 
The first dataset (SKF97541-bound GABA,) hada total of 14,271 micro- 
graphs (3.5 d of data collection), the second dataset (SKF97541- and 
GS39783-bound GABA,) of 10,917 micrographs (2.5 d of data collec- 
tion) andthe third dataset (apo GABA,) 5,947 micrographs (2 d of data 
collection). Beam-induced motion correction was performed using 
MotionCor2” and included binning of the super-resolution images 
by a factor of 2 for the first two K3 datasets. CTF estimation was done 
using CTFFIND4”. Corrected images for the first dataset were imported 
into the RELION 3.0 software package’, following established process- 
ing pipelines. After initial template-based autopicking, 2D classifica- 
tion resulted in classes from two distinctly different conformations 
of GABA, (Extended Data Fig. 2). Three-dimensional classification in 
RELION was used to separate particles into two classes. The particles 
from each class were then transferred to the cryoSPARC software pack- 
age” for detergent belt subtraction, followed by several rounds of 
ab-initio reconstruction, heterogeneous refinement, non-uniform 
refinement and local refinement, resulting in two reconstructions at 
6.3.and 4.8 A (Extended Data Fig. 2). The second dataset (SKF97541- and 
GS39783-bound GABA,) was directly imported into cryoSPARC; 2D 
classification resulted in 124,575 particles. A final set of 89,001 particles 
resulted in a reconstruction of 3.6 A resolution, based on the Fourier 
shell correlation (FSC) = 0.143 criterion (Extended Data Figs. 4, 5g). The 
third dataset (GABA, apo) was also only processed with cryoSPARC, 
witha final set of 113,093 particles resulting ina reconstruction at4.0A 
resolution (Extended Data Fig. 5). 


GABA, structure determination and refinement 

The initial model for the agonist- and PAM-bound GABA, was composed 
of aTMD model generated by homology from the crystal structure of 
mGlul (PDB 4OR2) using SWISS-MODEL* anda VFT model derived from 
the previously published crystal structure of GABA, VFT in complex 
with GABA (PDB 4MS3). Both models were first manually docked into 
the cryo-EM density map using UCSF ChimeraX* and further improved 
by using phenix.dock_in_map in PHENIX v.1.17.1%. The pseudosymmetry 
of the GABA, heterodimer was addressed by both docking known VFT 
crystal structures and pinpointing asymmetries in the glycosylation 
sites, most prominently seen on N440 of GBI. The model was then sub- 
jected to iterative manual building in Coot® and real-space refinement 
in PHENIX against sharpened 3.6 A resolution map using global mini- 
mization, local rotamer fitting and restrained group ADP refinement 


with secondary structure restraints and a nonbonded weight of 500. 
In this structure, we observed an extensive elongated density next to 
ICL2 of the TMD of GB2, stretching out in the intracellular space at an 
angle of about 20° to the plane of the membrane (Fig. 3a). Owing to the 
poor local resolution, it was not possible to unambiguously model this 
domain. However, based on its topology, we tentatively assigned it to 
the intracellular coiled-coil domain. 

The fully refined agonist- and PAM-bound GABA, TMDs along 
with a VFT model derived from the crystal structure of GABA, VFT 
in the apo state (PDB 4MQE) were used as an initial template for 
the apo state model by fitting into the cryo-EM density map and 
further improved by iterative manual building in Coot and real-space 
refinement in PHENIX against the sharpened 4.0 A resolution map 
using global minimization, local rotamer fitting and restrained ADP 
refinement with secondary structure restraints and a non-bonded 
weight of 500. 

The initial model for the int-1 state was generated by fitting two 
TMDs from the fully refined apo GABA, structure, as well as LB1 and 
LB2, of the VFT of GB1 anda whole VFT of GB2 from the crystal struc- 
ture (PDB 4MS3) in the map in ChimeraxX followed by rigid-body 
refinement in PHENIX. After manual adjustments in Coot, which 
included connecting separate domains and trimming most side 
chains and intracellular loops in TMDs, the model was subjected to 
real-space refinement in PHENIX against the sharpened 6.3 A resolu- 
tion int-1 map using global minimization, local rotamer fitting and 
restrained group ADP refinement with secondary structure and refer- 
ence model (apo structure) restraints and a non-bonded weight of 
500. Although the agonist is probably bound to the receptor in the 
int-1 state, it was not included in the model because of insufficient 
resolution. 

The initial model for the int-2 state was generated by fitting two 
TMDs from the fully refined apo GABAg, as well as two VFTs from the 
GABA-bound crystal structure (PDB 4MS3) in the map in Chimerax fol- 
lowed by rigid-body refinement in PHENIX. After manual adjustments 
in Coot, which included connecting separate domains and trimming 
some side chains and intracellular loops in the TMDs, the model was 
subjected to real-space refinement in PHENIX against the sharpened 
4.8 Aresolution int-2 map using global minimization, local rotamer 
fitting and restrained group ADP refinement with secondary structure 
and reference model (agonist- and PAM-bound structure) restraints and 
anon-bonded weight of 500. Cryo-EM data collection and refinement 
statistics are shown in Extended Data Table 1. 


Molecular dynamics simulations 

All molecular dynamics simulations were conducted with Gromacs“* 
v.2018 simulation engine under Charmm36 force field parameters and 
topologies*. Missing loops and side chains were modelled using loop 
modelling and optimization tools inICM-Pro* v.3.8.7b. The resulting 
structures were then uploaded to the Charmm-GUI webserver’ to gen- 
erate input files for the simulation. All structures were embedded intoa 
bilayer of POPC lipids; initial membrane coordinates were assigned by 
the PPM server*’ via the Charmm-GUI interface. The ‘Ligand Reader & 
Modeller’ of Charmm-GUI” was invoked to generate Charmm36 force 
field parameters and topologies for GS39783 and SKF97541. The apo 
system contained 375 POPC lipids, 61,663 water molecules, 167 sodium 
and 186 chloride ions, and the ligand-bound system contained 385 
POPC lipids, 66,825 water molecules, 181 sodium and 200 chlorideions. 
After initial energy minimizations, both systems were equilibrated for 
20 ns, followed by production runs of up to 800 ns for the apo system 
and 500 ns for the SKF97541- and GS39783-bound system. The simu- 
lations were performed either on NVIDIA P100 GPU enabled nodes 
made available by the Google Cloud Platform or with GPU clusters at 
the High-Performance Computing Center of the University of South- 
ern California. Molecular dynamics trajectories were analysed using 
MDAnalysis package°°. 


Normal mode analysis 

Normal modes of the apo and SKF97541- and GS39783-bound struc- 
tures were assessed using the ProDy package”. Two independent 
anisotropic network models (ANM), with respect to apo and SKF97541- 
and GS39783-bound structures, were defined in the presence of an 
explicit POPC lipid bilayer. Coordinates of the lipids were taken from 
Charmm-GUI output. Network nodes include Ca atoms from the pro- 
teins and heavy atoms from the lipids. The slowest modes of each ANM 
model were compared to the principal components of corresponding 
molecular dynamics trajectories by evaluating correlations between 
eigenvectors on corresponding Ca atoms”. 


Molecular pharmacology methods 

The pRKS plasmids encoding the wild-type human GBI, with the signal 
peptide of mGluS, followed by a HA tag and a Halo tag at the Nterminus, 
upstream of residue G18, or the wild-type human GB2 with the signal 
peptide of mGlu5S, a Flag tag followed by SNAP tag at the N terminus, 
upstream of W42, were obtained from the laboratory ofJ. Liu. Human 
GB1 and GB2 constructs used for cryo-EM analysis were subcloned 
into these pRKS plasmids to obtain a HA- and Halo-tagged GB1 con- 
struct (between the Mlu-l and Hind-III unique sites), and the Flag- and 
SNAP-tagged GB2 construct (between the Not-I and Hind-lll unique 
sites). The mutations of GB1 and GB2 in the pRKS plasmids were gener- 
ated by site-directed mutagenesis using the QuikChange mutagenesis 
protocol (Agilent Technologies). 

HEK293 cells (ATCC CRL-1573, authenticated by supplier using 
morphology and growth characteristics, certified mycoplasma-free) 
were cultured in Dulbecco’s modified Eagle’s medium (DMEM) supple- 
mented with 10% FBS and transfected by electroporation. Ten million 
cells were transfected with 260 V voltage and 800 pF capacitance using 
the Gene Pulser Xcell MicroPulser Electroporator (Bio-Rad), and then 
distributed into a 96-well plate (Greiner Bio-one). Eventually, Lipo- 
fectamine 2000 (Thermo Fisher Scientific) was used according to the 
manufacturer protocol for transfection. One hundred thousand cells 
were transfected with 20 ng GB1 plasmid and 40 ng GB2 plasmid either 
alone or simultaneously for 1 well ina 96-well plate. For the IP, detection, 
cells were also co-transfected with 10 ng chimeric G protein Gaq,, to 
allow efficient coupling of the receptor to the phospholipase C path- 
way”. IP, accumulation in HEK293 cells was measured using the IP-One 
HTRF kit (Cisbio) according to the manufacturer’s recommendations. 


Ligand-binding assay 

HEK293 cells were co-transfected with pRKS plasmids encoding 
wild-type or mutant GB1 and GB2 in 96-well plates. Intact cells were 
washed once with Tag-lite buffer (Cisbio) 48 h later and incubated with 
both5nM CGP54626-DY647 (Cisbio) and increasing concentrations of 
GABA for 3 hat 4 °C in Tag-lite buffer. After three washes with Tag-lite 
buffer, fluorescence was measured as the specific DY647 emission 
spectrum (665 nm) with an Infinite F500 reader (Tecan). 


Cell-surface quantification of receptor with SNAP tag or Halo tag 

For SNAP tag and Halo tag detection, HEK293 cells from 96-well plates 
were incubated at 37 °C with 100 nM of SNAP-Lumi4-Tb or 100 nM of 
Halo-Lumi4-Tb for 1h, 48 h after transfection. After labelling, cells were 
washed three times with Tag-lite buffer (Cisbio), and fluorescence of 
Lumi4-Tb (excitation at 337 nm, emission at 620 nm, 60 us delay, and 
400 ms integration time) was read using Infinite F500 reader (Tecan). 


Figures and graphical illustration 

Pymol v. 2.3.3 (Schroedinger), UCSF Chimerax v. 0.9, and ICM-Pro v. 
3.8.7b (Molsoft) were used to make figures and videos. All cryo-EM maps 
shown in the manuscript were masked to hide densities arising from 
the detergent belt. All reported r.m.s.d. values were calculated using 
the align command in Pymol. The distances between the C-terminal 
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ends of the GB1 and GB2 VFTs in Extended Data Fig. 7a, b were meas- 
ured between Ca atoms of residues D576 (cryo-EM models) or D459 
(X-ray models) in GB1 and D466 (cryo-EM and X-ray models) in GB2. 
The TM5-TMS and TM6-TM6 distances in Fig. 4b, c were measured 
between Ca atoms of residues G771°“! and V822°** in GB1 and residues 
G658°*! and V709°** in GB2 to compare with cross-linking data*®. The 
shifts of TM3 and TMS of GB2 in Fig. 4d were measured for Ca atoms 
of residues 1581?’ and T678>. The GABA, heterodimer interface areas 
in Extended Data Figs. 4g, 5i were calculated using the program PISA®. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The cryo-EM density maps and corresponding coordinates have been 
deposited in the Electron Microscopy Data Bank (EMDB) and the 
PDB, respectively, under the following accession codes: EMD-20822 
and 6U08 (SKF97541- and GS39783-bound GABA,), EMD-20823 and 
6U09 (SKF97541-bound GABA,, int-2 state), EMD-20824 and 6UOA 
(SKF97541-bound GABAg, int-1 state), and EMD-21219 and 6VJM 
(apo GABA,). 

Source data are provided with this paper. Supplementary Figure 1 con- 
tains the uncropped gel shown in Extended Data Fig. 1f. All other data 
are available from the authors on reasonable request. 
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Extended Data Fig. 1| Expression, characterization and purification of 
GABA,. 4, b, Cell surface expression of Halo—GB1 (a) and SNAP-GB2 (b), 
transfected alone or cotransfected with the second subunit, measured by the 
fluorescence emission of the Lumi4-Tb bound to the Halo (a) or SNAP tag (b). 
Values are normalized by the wild-type GB1 cotransfected with wild-type GB2 
(purple bar) and shownas mean + s.d. of three biologically independent 
experiments. GABA, constructs for cryo-EM are expressed and function 
similarly to the wild-type receptor. c, d, Positive allosteric effect of GS39783 
(5M) on IP, accumulation in cells expressing wild-type or cryo-EM constructs 


of GABA, heterodimers and activated either by GABA (c) or SKF97541 (d). Data 
are normalized by the wild-type response in absence of GS39783 (control) and 
shownas mean +s.e.m. of three biologically independent experiments (4 for 
wildtype). e, Representative SEC profile of apo GABA, in digitonin micelles. 
Dimeric fractions were pooled, supplemented with ligand and concentrated 
for cryo-EM imaging. f, Coomassie stained SDS-PAGE profile shows two 
distinct bands for GB1 (86 kDa) and GB2 (88 kDa). For gel source data, see 
Supplementary Fig. 1. 
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Extended Data Fig. 6| See next page for caption. 


Extended Data Fig. 6| Molecular dynamics simulations of inactive (apo) 
and active (agonist- and PAM-bound) states. a, VFT fluctuations inthe 
inactive state are illustrated by comparing molecular dynamics snapshots at 
500 ns for all 5 trajectories (cartoon, purple for a-helices and yellow for 
B-strands) against the cryo-EM structure (grey surface). Cyan spheres indicate 
nitrogen of hydrophilic heads of the POPC lipid bilayer. Asterisk corresponds to 
trajectory 1.b, Traces of Car.m.s.d. for different domains in trajectory 1. The 
structures in the trajectory were aligned by TMDs against the cryo-EM 
structure. c, Distribution of distances for ionic interactions between TM3 and 
TMSat the intracellular side of the apo GABAg, as observed in molecular 
dynamics simulations. d,e, Membrane-coupled ANM analysis of the inactive 
(d)- and active (e)-state cryo-EM structures. Eigenvectors of the slowest mode 
are drawn onevery eighth Ca as magenta arrows. In the inset heat map, overall 
dynamics of the protein as observed in molecular dynamics simulations are 
compared against that predicted from ANM analysis on electron microscopy 
structures; correlations between eigenvectors of the ten slowest ANM modes 
and the principal components of Ca motions in molecular dynamics 


simulations are calculated, showing concurrence between the two. f, Distance 
plots as observed in agonist- and PAM-bound trajectories for interactions 
between SKF97541 (SKF) and various contact residues in the GB1 pocket (left 
column), interactions between GS39783 (PAM) and its contact residues at the 
TMD interface (right column, rows 1-3) and interactions between GB1l and GB2 
residues (right column, rows 4 and 5). g,h, Traces of Car.m.s.d. onthe overall 
structure for individual molecular dynamics trajectories (green) and plots of 
root mean square fluctuations (r.m.s.f.) per residue in GB1 (blue) and GB2 
(yellow) subunits as observed in molecular dynamics simulations for inactive 
(g) and active (h) states. i,j, Stacked snapshots of GS39783 (i) and SKF97541 (j) 
binding site for one of the trajectories, taken every 50 ns and spanning 500 ns. 
Intraces of Car.m.s.d. (b, g, h), data (solid line) are presented as an average 
within a sliding window of S00 ps; shading refers to 95% confidence interval 
(n=500). In plots of r.m.s.f. (g, h), data (solid line) are shown as an average of 
5independent trajectories at each residue position; shading refers to 95% 
confidence interval (n=5). 
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Extended Data Fig. 7 | Comparisons with previous crystal structures 

and additional details of activation-related transitions in GABA,. 

a, Comparison of the VFTs between the apo cryo-EM structure (teal) andthe 
apo crystal structure (PDB 4MQE, grey). b, Comparison of the VFTs between 
the active agonist- and PAM-bound cryo-EM structure (orange) and the 
GABA-bound crystal structure (PDB 4MS3, grey). c, Comparison between GB1 
and GB2 TMDs in the inactive apo state demonstrates no substantial difference 


between the subunits. d, Comparison between GB1 and GB2 TMDsin the active 
agonist- and PAM-bound state illustrates the differences between the subunits 
upon activation. e, The TMD of GB1 remains unchanged between the int-2 and 
the active agonist- and PAM-bound states. f, Shifts of TM3 and TMS at the 
intracellular side of the TMD of GB2 between the int-2 and active states openup 
acleft for the engagement of aG protein. 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | Effects ofa PAM onthe orthosteric ligand binding in 
the site-1 and -2 mutants of GABA,. a, Displacement of the CGP54626-DY647 
by GABA in intact cells expressing the wild-type GABA, receptor in the absence 
(blue) or presence of 5 1M (red), 10 pM (orange) or 20 pM (sand) GS39783. 

b, Cell-surface expression of site-1and -2 mutants, measured by cotransfecting 
Halo-GB1 with GB2 and recording fluorescence emission of Halo-Lumi4-Tb. 
Data are normalized by the wild-type expression and shown as mean +s.d. The 
numbers of biologically independent experiments are shown in parentheses. 
c, pKi values for GABA were determined from displacement of CGP54626- 
DY647 binding in intact cells expressing the indicated subunit combinations in 
the absence or presence of 1uM or 51M GS39783. Values are mean +s.e.m. of 

4 (10 for wild type) biologically independent experiments. Data are analysed 


using one-way ANOVA with Dunnett’s multiple comparison test to determine 
significance (compared with no GS39783 for the same combined subunits), 
with ****P< 0.0001, ***P< 0.001, *P< 0.01and NS (not significant) P> 0.05. For 
1M (5 uM) GS39783, P= 0.0030 (0.0045) for GB2(R556A), 0.0205 (0.0157) 

for GB2(H647A), 0.0084 (0.0009) for GB2(L657A), 0.0436 (0.0003) for 
GB2(F711A), 0.0013 (0.0053) for GB2(R714A), 0.0373 (0.0056) for GB2(Q720A), 
and 0.0735 (0.0019) for GB2(SFR-AAA). GB2(SFR-AAA) refers toa triple mutant 
(S710A/F711A/R714A) of GB2.d, e, Displacement of the CGP54626-DY647 by 
GABA in intact cells expressing the indicated mutants in site 1 (d) and site 2 (e), 
inthe absence or presence of the indicated concentrations of GS39783 asina. 
Data are normalized by the signal in absence of GS39783 and shown as 

mean +s.e.m. of three biologically independent experiments. 
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Extended Data Fig. 9 | The effects ofa PAM on signalling of the site-1 and -2 0.9999 for GB2(H647A), 0.9907 for GB2(L657A), 0.9995 for GB1(F711A), 0.9997 
mutants of GABA,. a, IP, production induced by 10 pM GS39783 in intact cells for GB2(R714A), 0.9996 for GB2(Q720A), and 0.2807 for GB2(SFR-AAA). b-d, IP, 


expressing the indicated site-1 GB2 mutants co-expressed with the wild-type production induced by GABA (blue) or GS39783 (red) in intact cells expressing 
GBI. Data are normalized by the GABA response and shown as mean +s.d. of7 the wild-type receptor (b) or the indicated mutant of the site 1 (c) or site 2 (d). 
(10 for wild type) biologically independent experiments. Data are analysed Data are normalized by the GABA response and shownas mean +s.e.m. of four 
using one-way ANOVA with Dunnett’s multiple comparison test to determine biologically independent experiments. 


significance (compared with wild type), with P= 0.9998 for GB2(R556A), 


Article 


Extended Data Table 1| Cryo-EM data collection, refinement and validation statistics 


GABAg apo GABA3/SKF97541 | GABA3/SKF97541__ GABA3/SKF97541/ 

Int-1 Int-2 GS39783 
(EMDB-21219) (EMDB-20824) (EMDB-20823) (EMDB-20822) 
(PDB 6VJM) (PDB 6UOA) (PDB 6U09) (PDB 6U08) 


Data collection and processing 


Magnification 135,000 29,000 29,000 29,000 
Voltage (kV) 300 300 300 300 
Electron exposure (e/A’) 8 15 15 15 
Defocus range (jum) -1.5 - -2.5 -1.5 - -3.0 -1.5 - -3.0 -1.5 - -3.0 
Pixel size (A) 1.08 0.8521 0.8521 0.8521 
Symmetry imposed Cl Gl Cl Cl 
Initial particle images (no.) 2,010,390 5,641,714 5,641,714 4,173,286 
Final particle images (no.) 113,093 48,600 114,552 89,001 
Map resolution (A) 4.0 6.3 4.8 3.6 
FSC threshold 0.143 0.143 0.143 0.143 
Map resolution range (A) 3.7 - 4.6 5.7 - 12.2 4.5 - 7.3 3.4 - 7.0 
Refinement 
Initial model used (PDB code) 4MQE, 6U08 4MQE, 6VJM 4MS3, 6UO8 4MS3, 40R2 
Model resolution (A) 42 7.6 5.2 3.8 
FSC threshold 0.5 0.5 0.5 0.5 
Model resolution range (A) 29.1-3.4 29.3 - 1.8 34.3 - 3.5 33.7 - 3.0 
Map sharpening B factor (A) -143.0 -378.1 -230.7 -125.8 
Model composition 
Non-hydrogen atoms 10,629 9,153 9,915 10,958 
Protein residues 1,369 1,323 1,365 1,385 
Ligands 0 0 1 2: 
N-glycans 11 11 13 13 
B factors (A?) 
Protein 147.4 361.4 209.8 116.0 
Ligands N/A N/A 141.0 120.8 
N-glycans 191.8 421.3 260.1 145.3 
R.m.s. deviations 
Bond lengths (A) 0.005 0.006 0.006 0.004 
Bond angles (°) 0.912 1.300 1.360 0.713 
Validation 
MolProbity score 1.86 1.87 1.86 1.68 
Clashscore 6.77 6.80 6.36 4.16 
Poor rotamers (%) 0.19 1.14 1.07 0.00 
Ramachandran plot 
Favored (%) 91.75 92.94 92.03 92.23 
Allowed (%) 8.25 7.06 7.97 7.77 
Disallowed (%) 0.0 0.0 0.0 0.0 


Extended Data Table 2 | Mutation study of GABA, 


sucsneas GABA GABA GS39783 F oe 
pK, +S.D. [GS] (n) pEC,, + S.D. (n) pEC,, + S.D. (n) hen vena 
WT +WT 5.44 + 0.06 [0] (10) 6.19 + 0.08 (4) 5.03 + 0.08 (4) 91.9 + 4.8 (4) 
5.65 + 0.08 [1] (10) 
5.92 + 0.12 [5] (10) 
5.95 + 0.16 [10] (7) 
6.02 + 0.15 [20] (7) 
WT + R556A 5.48 + 0.07 [0] (4) 6.16 + 0.07 (4) 4.93 + 0.14 (4) 98.5 + 8.7 (4) 
5.80 + 0.05 [1] (4) 
6.13 + 0.08 [5] (4) 
6.16 + 0.25 [10] (4) 
6.18 + 0.24 [20] (4) 
WT + H647A 5.58 + 0.13 [0] (4) 6.26 + 0.09 (4) 5.00 + 0.11 (4) 90.8 + 6.6 (4) 
5.79 + 0.09 [1] (4) 
6.12 + 0.08 [5] (4) 
6.06 + 0.14 [10] (4) 
6.15 + 0.17 [20] (4) 
WT +L657A 5.59 + 0.11 [0] (4) 6.2 + 0.13 (4) 5.04 + 0.14 (4) 99.5 + 8.6 (4) 
5.88 + 0.07 [1] (4) 
6.16 + 0.06 [5] (4) 
6.10 + 0.16 [10] (4) 
6.35 + 0.14 [20] (4) 
WT+F711A 5.65 + 0.16 [0] (4) 6.3 + 0.1 (4) 5.14 + 0.11 (4) 90.7 + 64 (4) 
5.84 + 0.11 [1] (4) 
Site 1 6.13 + 0.14 [5] (4) 
5.88 + 0.26 [10] (4) 
5.86 + 0.23 [20] (4) 
WT +R714A 5.46 + 0.05 [0] (4) 6.15 + 0.06 (4) 5.08 + 0.1 (4) 88.2 + 5.5 (4) 
5.63 + 0.06 [1] (4) 
5.93 + 0.14 [5] (4) 
5.92 + 0.16 [10] (3) 
5.65 + 0.20 [20] (3) 
WT+Q720A 5.48 + 0.16 [0] (4) 6.37 + 0.07 (4) 5.35 + 0.11 (4) 80.1 + 4.7 (4) 
5.70 + 0.22 [1] (4) 
5.96 + 0.11 [5] (4) 
5.82 + 0.16 [10] (3) 
6.03 + 0.18 [20] (3) 
WT+SFR-AAA 5.75 + 0.15 [0] (4) 6.34 + 0.06 (4) 5.39 + 0.12 (4) 93.7 + 5.9 (4) 
5.93 + 0.08 [1] (4) 
6.16 + 0.10 [5] (4) 
5.98 + 0.18 [10] (3) 
5.98 + 0.19 [20] (3) 
WT + Y697A 5.43 + 0.04 [0] (4) 5.56 + 0.09 (3) N.D. N.D. 
5.43 + 0.10 [1] (4) 
5.45 + 0.05 [5] (4) 
WT+N698A 5.48 + 0.07 [0] (4) 5.96 + 0.06 (3) N.E. N.E. 
5.48 + 0.05 [1] (4) 
5.46 + 0.07 [5] (4) 
5.30 + 0.16 [10] (3) 
5.18 + 0.16 [20] (3) 
WT +MYN-AAA* 5.44 + 0.05 [0] (4) 5.25 + 0.16 (3) N.E. N.E. 
5.44 + 0.05 [1] (4) 
5.56 + 0.18 [5] (4) 
Site 2 M807A+WT 5.51 + 0.06 [0] (3) 6.14 + 0.06 (3) 4.72 + 0.12 (3) 73.5 + 6.6 (3) 
5.60 + 0.09 [1] (3) 
5.74 + 0.15 [5] (3) 
Y810A+WT 5.48 + 0.09 [0] (3) 5.59 + 0.04 (3) N.E. N.E. 
5.49 + 0.03 [1] (3) 
5.46 + 0.09 [5] (3) 
N811A + WT 5.45 + 0.12 [0] (3) 5.74 + 0.03 (3) N.E. N.E. 
5.43 + 0.07 [1] (3) 
5.48 + 0.16 [5] (3) 
MYN-AAA+WT 5.47 + 0.10 [0] (3) 5.12 + 0.03 (3) N.E. N.E. 
5.44 + 0.09 [1] (3) 
5.45 + 0.11 [5] (3) 
Apparent affinities for GABA (pK;) in the presence of indicated concentrations of GS39783 (0, 1, 5, 10 or 20 uM) and functional data for the GABA or GS39783 alone. N.E., no effect (absence of 
response); N.D., not determined (value cannot be calculated). The number of biologically independent experiments (n) is shown in parentheses. 


*Mutation MYN-AAA in GB2 refers to a triple mutant (M694A/Y697A/N698A). 
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For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 
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Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Cryo-EM data were collected automatically on a Titan Krios (FEI) using SerialEM v.3.8beta. 


Data analysis The following software was used in this study: MotionCor2, CTFFIND v4.1.10, RELION v. 3.0, CryoSparc v. 2.11, UCSF Chimerax v. 0.7.0, 
Coot v. 0.8.9, PYMOL v. 2.3.3, Phenix v 1.17.1-3660, ccp4 v. 7.0.078, ICM-Pro v. 3.8.7b, ProDy v. 1.10.10, MDAnalysis v. 0.20.1, Gromacs v. 
2018, Graphpad Prism v6, Swiss Model webserver (https://swissmodel.expasy.org), Charmm-GUI webserver (http://www.charmm- 
gui.org/). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The cryo-EM density maps and corresponding coordinates have been deposited in the Electron Microscopy Data Bank (EMDB) and the Protein Data Bank (PDB), 
respectively, under the following accession codes: EMD-20822 and 6UO8 (GABAB bound to SKF97541 and GS39783), EMD-20823 and 6UO9 (GABAB bound to 
SKF97541, int-2 state), EMD-20824 and 6UOA (GABAB bound to SKF97541, int-1 state), and EMD-21219 and 6VJM (GABAB in the apo state). 

Figs. 2e, f, 3e, f and Extended Data Figs. 1a-d, 8, 9 have associated raw data. Supplementary information file contains the uncropped gel shown in Extended Data 
Fig. 1f. All other data are available from the authors on reasonable request. 
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Sample size No statistical methods were used to predetermine sample size. Pharmacological assays were conducted at least in n=3 biologically 
independent experiments and are comparable to other published studies. Wild-type constructs were used as internal controls, resulting in 
larger number of independent repeats. Sample size for cryo-EM experiments was determined by availability of microscope time and to ensure 
refining a high-resolution structure that allowed us to obtain a high-resolution reconstruction within the confines of limited microscope time. 
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Data exclusions No pharmacological data were excluded. There were no pre-established criteria for exclusion of cryo-EM data. However, the process of 
generating 3D maps from cryo-EM particles involves sorting for particles that are damaged, have low signal, or are in minority conformations 
that are unlikely to refine correctly. This is implemented in Relion 3.0 and CryoSparc 2.11. 


Replication All pharmacological measurements were done at least in triplicates. All attempts at replication were successful and included in the final data. 
There was no attempt to replicate cryo-EM data. Each dataset includes averaging of tens of thousands of particles. 


Randomization — This study did not allocate experimental groups thus no randomization was required for the reported experiments. 


Blinding The researchers were not blinded to allocation during experiments and outcome assessment. Blinding was not required for the reported 
experiments because all functional and structural data were analyzed using the same methods, and results are not subjective. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Anti-FLAG M2 antibody gel. Supplier MilliporeSigma, Catalog number A2220, M2 monoclonal, Lot number SLCD7154, no 
dilution. 
Anti-GFP nanobody resin, cAbGFP4, no dilution, made in-house using protocols from Kubala et al. "Structural and 
thermodynamic analysis of the GFP: GFP-nanobody complex." Protein Science 19.12 (2010): 2389-2401. 


Validation Anti-FLAG M2 antibody gel, full validation report is available on the supplier website. 
Anti-GFP nanobody, validation was reported by Kubala et al. in "Structural and thermodynamic analysis of the GFP: GFP- 
nanobody complex." Protein Science 19.12 (2010): 2389-2401. 


Eukaryotic cell lines 
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Cell line source(s) Cell lines were purchased from the American Type Culture Collection (ATCC). Insect Sf9 cell line: ATCC CRL-1711. Human 
HEK293 cell line: ATCC-CRL-1573. 


Authentication The cell line was authenticated by the supplier (ATCC) using morphology and growth characteristics. 


Mycoplasma contamination Cells have been tested and shown to be free from mycoplasma (Hoechst DNA strain and Direct Culture methods employed). 


Commonly misidentified lines No commonly misidentified cell lines were used. 
(See ICLAC register) 
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The human GABA, receptor—a member of the class C family of G-protein-coupled 
receptors (GPCRs)—mediates inhibitory neurotransmission and has been implicated 
in epilepsy, pain and addiction’. A unique GPCR that is known to require 
heterodimerization for function’ ®, the GABA, receptor has two subunits, GABA;, and 
GABAg,, that are structurally homologous but perform distinct and complementary 


functions. GABA,, recognizes orthosteric ligands’*, while GABA,, couples with G 
proteins’ “. Each subunit is characterized by an extracellular Venus flytrap (VFT) 
module, a descending peptide linker, a seven-helix transmembrane domain anda 
cytoplasmic tail. Although the VFT heterodimer structure has been resolved”, the 
structure of the full-length receptor and its transmembrane signalling mechanism 
remain unknown. Here we present a near full-length structure of the GABA, receptor, 
captured in an inactive state by cryo-electron microscopy. Our structure reveals 
several ligands that preassociate with the receptor, including two large endogenous 
phospholipids that are embedded within the transmembrane domains to maintain 
receptor integrity and modulate receptor function. We also identify a previously 
unknown heterodimer interface between transmembrane helices 3 and5 of both 
subunits, which serves as a signature of the inactive conformation. A unique 
‘intersubunit latch’ within this transmembrane interface maintains the inactive state, 
and its disruption leads to constitutive receptor activity. 


We assembled the heterodimeric human GABA, receptor by using 
baculovirus-infected mammalian cells. Each subunit was truncated 
at the carboxy-terminal end according to the domain boundary of 
the intracellular coiled-coil”, in order to eliminate flexible regions 
(Supplementary Fig. 1). Upon extraction and purification with deter- 
gent (Extended Data Fig. 1a, b and Supplementary Fig. 2), the complex 
bound to radioactive [7H]GABA with a dissociation constant compa- 
rable to the GABA affinity reported for native receptors**” (Extended 
Data Fig. 1c). Functional analysis incorporating a chimaeric Ga,,; protein 
and inositol phosphate assay’* in mammalian cells showed that the 
agonist baclofen activated the C-terminally truncated and full-length 


receptor with similar potency and efficacy, indicating that the deleted 
regions in the GABA,,, and GABA,, cytoplasmic tails are not required 
for ligand-mediated G-protein activation (Extended Data Fig. 1d). 
We determined the structure of the heterodimeric GABA, recep- 
tor by cryo-electron microscopy (cryo-EM) to an overall resolution of 
3.3 A (Extended Data Fig. 2a—e and Supplementary Table 1). The global 
density map displayed directional anisotropy owing to linker flexibility 
(Extended Data Fig. 2f). Performing local refinement separately onthe 
extracellular domains (ECDs) and transmembrane domains yielded 
reconstructions at resolutions of 3.1A and 3.4 A, respectively (Extended 
Data Fig. 2g, h). We used acomposite map that combined the ECD and 
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Fig.1| Cryo-EM structure of human GABA, receptor. a, Ribbon 
representation of the GABA,-receptor structure, composed of GABAsg,, (blue) 
and GABA,, (green) subunits. Ca” is shownasa green sphere; phospholipids 
(PE 38:5 and PC 38:2) are shownas yellow space-filling models; N-linked glycans 
(NAGs) and cholesterols (CLRs) are shown, respectively, as grey and pink 
ball-and-stick models. Transmembrane helices 1-7 are marked for each subunit. 
b, Cryo-EM density map of the GABA, receptor, composed of local 
reconstructions for extracellular (3.1A) and transmembrane (3.4.A) domains, in 
an orthogonal view froma. c, Linker domains of GABA,,, and GABA,,, showing 
the main-chain and side-chain hydrogen-bonding patterns between the linker 
regionand ECL2. 


transmembrane reconstructions for model building and refinement 
(Extended Data Fig. 3). By applying three-dimensional variability analy- 
sis to the data, we found that the receptor is in dynamic motion, and 
that its functional state corresponds to a continuum of conformations 
along multiple dimensions (Supplementary Videos 1-4). 

The heterodimeric GABA, receptor is assembled when GABA,,, 
and GABAg, subunits interact side-by-side while facing in opposite 
directions (Fig. 1a, b and Extended Data Fig. 4). Both the VFT and 
the transmembrane components of the two subunits are related by 
pseudo-twofold axes. Extracellular and intracellular loops (ECLs and 
ICLs) that interconnect adjacent helices within each transmembrane 
domain are visible in the density map, except for ICL2. The cytoplasmic 
tail including the coiled-coil domain is disordered, possibly because of 
its flexible attachment to the transmembrane domain. 

The elongated peptide linker that joins the VFT to the transmem- 
brane domain is buttressed through its interaction with a B-hairpin 
formed by ECL2 (Fig. 1c). ECL2 twists across the linker, forming a united 
mechanical junction that transmits the conformational changes in 
the VFT to the transmembrane domain and vice versa. Additionally, 
the ECD and transmembrane domains of the receptor spontaneously 
flex back and forth about the linker, exhibiting the region’s intrinsic 
flexibility (Supplementary Video 1). 


Given that no ligand was added during protein purification, we 
expected the receptor to be inan apo form and an inactive conforma- 
tion. To our surprise, we observed several ligands bound to the receptor. 
GABAg,, contains a Ca** ion at the interdomain cleft of VFT. In addition, 
an endogenous phospholipid is bound within the transmembrane 
pocket of each subunit, with a phosphatidylethanolamine (PE 38:5) 
in GABAg,,, and a phosphatidylcholine (PC 38:2) in GABAg>. Finally, 
ten cholesterol or cholesteryl hemisuccinate molecules, which we 
modelled as cholesterol (see Methods), are distributed around the 
exterior of the transmembrane complex, including two that interface 
with both subunits. 


Inactive conformation of GABA, receptor 


The cryo-EM structure of the GABA, receptor appears to occupy an 
inactive conformation, given its similarity to known crystal structures 
of the GABA, VFT inits apo and antagonist-bound states”. First, the VFT 
module, composed of the lobe-shaped LB1 and LB2 domains, adopts 
an open interdomain conformation in both subunits (Extended Data 
Fig. 5a, b). Second, although an amino-terminal LB1-LB1 dimer interface 
is present in all functional states, the distinct lack of a heterodimer 
interface between the membrane-proximal LB2 domains is shared by 
the near full-length cryo-EM structure and inactive-state VFT structures 
(Extended Data Fig. 5c, d). By contrast, a hallmark of the active-state 
VFT structures is a unique heterodimer interface between LB2 domains, 
which results from agonist-induced GABAg,y, closure’®. 

Using conformational variability analysis, we observed that the LB2 
domains of both subunits fluctuate by approaching and withdrawing 
fromthe central vertical axis of the heterodimer, yet never make con- 
tact as they do in active-state VFT crystal structures (Supplementary 
Videos 2, 3). The motion exhibited by the LB2 domains and their asso- 
ciated linkers suggests that the inactive functional state corresponds 
to an ensemble of conformations, where the separation between the 
membrane-proximal regions falls within a small range around the coor- 
dinates of our structure. 


Inactive transmembrane heterodimer interface 


We identified a hitherto unknown heterodimer interface between the 
transmembrane TMS and TM3 helices of both subunits that embodies 
the signature of the GABA, receptor’s inactive conformation (Fig. 2a). 
ATMS-TMS contact has previously been detected through crosslinking 
of the GABA, receptor”, but transmembrane dimer interfaces of any 
kind are yet to be found in other inactive class C GPCRs, including the 
recent inactive metabotropic glutamate receptor mGlu, structure” 
(Extended Data Fig. Se). 

Positioned at 30° from the extracellular dimer interface, the trans- 
membrane dimer interactions bury approximately 740 A? of surface 
area and exhibit high shape complementarity (Extended Data Fig. 5f). 
The pair of TMS helices scissor at their central residues before con- 
tacting the transverse TM3 of opposing subunits at their intracellular 
ends (Fig. 2a). TM5 extracellular ends display conformational variance 
wherein they approach and withdraw but fail to make contact (Sup- 
plementary Video 4). All direct heterodimer interactions occur near 
the cytoplasmic membrane surface and can be divided into three core 
layers (I-III) along the helical path of TMS (Fig. 2a). 

The surface layer 1 caps the extracellular end of the transmembrane 
heterodimer interface. It iscomposed of hydrophobic contacts between 
four leucine residues of both subunits. The middle layer II lies directly 
beneath layer I, and consists of three phenylalanine residues packing 
against one another, as well as their neighbouring leucine residues. 
Both layers solely incorporate TMS residues. 

Layer III, consisting of sections Illa and IIIb, completes the interface at 
the intracellular end. Illa possesses a network of salt bridges that tether 
the cytoplasmic ends of GABA;,, and GABA,, transmembrane domains. 
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Fig. 2 | Transmembrane heterodimer interface of the GABA, receptor. 

a, The transmembrane heterodimer interface formed by TMS and TM3 of both 
subunits. Three layers (I, land III) ofinterfacial contacts are identified by 
dotted circles. Direct heterodimer contacts within each layer are shown onthe 
right. b, c, Functional analysis of the ‘intersubunit latch’. Basal activity (b) and 
dose-dependent baclofen-stimulated receptor response (c) in cells transiently 
expressing a Ga,,; chimaera (abbreviated as Gqi) with different combinations 
of GABA, wild-type subunits (WT) and mutant subunits (GABAg;, E673R, 
abbreviated as E673R; GABA,, H579E, abbreviated as H579E). Inositol- 
1-phosphate (IP,) accumulation was measured in the presence or absence of 


This critical interaction is mediated by a quartet of charged residues 
from the TM3 and TMS helices (His 572? and Glu 673°° in GABAgy,; 
His 5792 and Glu 677°” in GABA,,; Supplementary Table 2), a feature 
that we refer to as the ‘intersubunit latch’ for securing the transmem- 
brane orientation of the two subunits in the inactive conformation. 
One of the ‘intersubunit latch’ residues shares nonpolar contacts with 
alysine of GABA, TMS, establishing the accessory site IIIb. 

In addition, the central layers of direct heterodimer contacts are 
flanked on each side by a cholesterol molecule. One mediates the 
interaction between TMS helices of both subunits (CLR3), while the 
other bridges TM3 of GABA,,, and TMS of GABA,, (CLR6) (Extended 
Data Fig. 5g). 


The ‘intersubunit latch’ 


To determine the importance of the ‘intersubunit latch’ in controlling 
the inactive state of the GABA, receptor, we examined the effect of 
single charge-repelling mutations (GABA,;,, E673R and GABA,, H579E) 
within the motif. Wild-type GABA, receptor exhibited basal activity in 
the absence of agonist (Fig. 2b), as reported previously“. Both mutants 
substantially increased basal activity when compared to the wild type, 
suggesting that each mutation promotes agonist-independent consti- 
tutive activity (Fig. 2b). The basal activity displayed by these mutants 
reached approximately 80-90% of the maximal agonist-dependent 
wild-type response (Fig. 2b, c). Application of the agonist baclofen 
raised receptor activity further to the highest wild-type level (Fig. 2c). 
Neither mutation altered the agonist potency. After each construct 
was treated with the inverse agonist CGP54626, the basal activity of 
mutants still remained higher than that of the wild type, providing 
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20 uM CGP54626 (abbreviated as INV). Cells transfected with empty pcDNA3.1 
vector (abbreviated as pcDNA), Ga,;; alone, or WT GABA, subunits in the 
absence of Ga,;; were used as controls. Relative activity is expressed as 
percentage of maximum wild-type activity induced by baclofen relative to the 
activity of Ga,;;alone. Data points represent averages + s.e.m. of multiple 
experiments (n values shown), each with quadruplicate measurements. 

****P < 0.0001; one-way analysis of variance (ANOVA) with Bonferroni's post 
hoc test was used to calculate statistical differences in basal activity (b). The 
level of cell surface expression was 90% for E673R/WT and 76% for WT/H579E 
mutants in comparison with the WT/WT heterodimer. 


further evidence that the ‘intersubunit latch’ mutations serve to shift 
the conformational equilibrium of GABA, towards an active state. Taken 
together, our mutational data indicate that the ‘intersubunit latch’ is 
fundamental to maintaining the inactive state of the receptor. 


Endogenous ligands bound to GABA, VFT 


We identified a new potential Ca”*-binding site in the vicinity of the 
orthosteric ligand-binding cleft on the LB2 surface of GABA,,, (Fig. 3a 
and Extended Data Fig. 6a, b). The metal ion density has remarkable 
peak height in the cryo-EM density map (100) and is surrounded by 
residues that are chemically favourable for Ca” coordination. The Ca** 
is anchored by the carboxylate groups of three acidic residues (Asp 281, 
Glu 309 and Glu 423) and by the backbone carbonyl atoms of two addi- 
tional residues (Gly 277 and Tyr 279) (Fig. 3b). The Ca?*-O bond distances 
are between 3.0 Aand4.6A, suggesting that the ion isin a hydrated state. 
The Ca” location in GABA,,, is different from any of the four sites found 
inthe calcium-sensing (CaS) receptor”, another class C GPCR (Fig. 3c). 

Using inductively coupled plasma mass spectroscopy, we detected 
the presence of Ca and Cu above background level in purified GABA, 
receptor. The amounts of other metal elements were negligible (Supple- 
mentary Table 3). The molar ratios of Caand Cu relative to the receptor 
protein (0.43:1 and 0.51:1) suggest partial occupancy of the ion-binding 
sites. An unmodelled density within the interdomain cleft, coordinated 
by tryptophan and histidine, may serve as a potential Cu?*-binding site 
but lacks sufficient signal to be labelled confidently. 

In exploring the functional role of the bound ion, we found that the 
Ca** chelator EGTA substantially reduced GABA, basal activity, and that 
mutating specific coordinating residues (GABA, E309K and GABA,;, 
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Fig. 3| Ca” binding in GABA,,,. a, Ribbon representation of the GABA, 
subunit, showing the location of the Ca**-binding site at the interdomain 
crevice of VFT. b, Specific interactions between GABA,,, and Ca”*. The mesh 
represents the cryo-EM density map contoured at 7.50 surrounding Ca’. 

c, Crystal structure of the CaS receptor extracellular domain (Protein DataBank 
(https://www.rcsb.org) code 5K5S), highlighting its bound Ca” (black spheres, 
numbered 1-4 and I’-4’ inthe two protomers) and the corresponding location 
of the Ca”*-binding site in GABA,,, (green sphere). 


E423R) had similar, although less drastic, effects (Extended Data Fig. 6c, 
d). Consistent with previous findings, these data suggest that Ca** may act 
as a positive allosteric modulator of the GABA, receptor”. Our structure 
implies that Ca** stabilizes residues adjacent to the critical agonist-binding 
residue Trp 278, thereby indirectly reinforcing its conformation (Fig. 3b). 

We also found density in the orthosteric agonist-binding site 
of GABA,,,, and its shape suggests a GABA-like endogenous ligand 
(Extended Data Fig. 6a, b). GABA is a potential candidate because it fits 
the density and was detected in the lysate of cells used to express the 
receptor (Extended Data Fig. 6e, f). This GABA-like endogenous ligand 
bound in an inactive receptor conformation may reflect a preactivation 
state; however, further investigation is required. 


Discovery of endogenous phospholipid ligands 
Our structure revealed endogenous phospholipids within the trans- 
membrane domains of both GABA, subunits. Using the CaS receptor as 
acontrol in mass spectrometry, we identified two phospholipids specifi- 
cally bound tothe GABA, receptor, PE 38:5 and PC 38:2 (Fig. 4a-d). Both 
lipids consist of two long-chain fatty acyl moieties of 18 and 20 carbons. 
We further assigned PE 38:5 to GABA;,, and PC 38:2 to GABA,, onthe basis 
of the size difference between the phosphoethanolamine and phospho- 
choline head groups of the two lipids (Extended Data Fig. 7a, b). The 
lipid density in the GABA,, transmembrane domain has a bulkier head 
group that can better accommodate the larger choline moiety of PC 38:2. 
Mirroring the amphipathicity of phospholipids, the lipid-binding 
pocket of each GABA, subunit retains a hydrophilic trunk and two lipo- 
philic branches for binding the lipid polar head and nonpolar tails, 
respectively. The trunk consists of a negatively charged patch cover- 
ing the amine moiety, and a positively charged area surrounding the 
phosphate (Extended Data Fig. 7c, d). The lipid-binding pockets are 
notably deep, extending from the extracellular membrane surface to 
the centre of the transmembrane domain (Fig. 4e, f). Each lipid occupies 
nearly the entire range of ligand-binding positions in class A, B, Cand 
F GPCRs (Extended Data Fig. 7e-h). 


Each GABA, subunit makes extensive contacts with the bound lipid, 
using a majority of the transmembrane helices, including TM2, 3, 5,6 
and 7 (Extended Data Fig. 7a, b, i,j). Approximately 2,400 A? of surface 
area is buried by either lipid-subunit pair. In addition, ECL2 directly 
contacts PE 38:5 in GABAg,,, while the linker and all three ECLs interact 
with PC 38:2 in GABAg>. 

Key elements of the lipid-receptor interactions are conserved in 
GABA,,, and GABA,, (Fig. 4g-j, Extended Data Fig. 7i, j and Supplemen- 
tary Fig. 3). First, the hydrophilic head of each lipid is anchored through 
interactions with conserved histidine and arginine residues (His 643 of 
ECL2 and Arg 549°” in GABAg,,; His 647 of ECL2 and Arg 556°” in GABA,,) 
(Fig. 4g, h). GABA, also incorporates Arg 714 of ECL3, rendering the 
lipid-binding pocket more electropositive than that of GABA,,, (Fig. 4h). 

Second, the 20-carbon fatty acyl chain of both lipids follows a per- 
pendicular turn to pass between two aromatic residues (Phe 557**° and 
Tyr 657°** in GABAgy,; Tyr 564° and Tyr 661°** in GABA,,) (Fig. 4, j). 
A cis double bond in each fatty acyl chain forms tt interactions with 
the aromatic side chains. The bend is further buttressed by extensive 
nonpolar contacts with the aliphatic part of a conserved lysine (GABA,;, 
Lys 660°’; GABA,, Lys 664>*’) lying parallel to the chain. Finally, the 
18-carbon fatty acyl chain of both lipids is relatively straight, extend- 
ing towards the cytoplasm ina binding pocket lined by small aliphatic 
residues on TM2, TM3 and TM7 (Fig. 4i,j). 

The GABA, receptor’s transmembrane domains are covered by the 
linker and ECLs, which form a lid over the lipid-binding pocket. Phos- 
pholipids may access the lipid-binding pocket of designated subunits 
laterally through gaps between TMS and TM6 (Extended Data Fig. 8a—d). 
One of the fatty acyl tails of each phospholipid even protrudes 
through this opening. The lipid-binding sphingosine-1-phosphate 
(SIP,) receptor possesses a similar gap between TM1 and TM7 (ref. ”; 
Extended Data Fig. 8e, f). The endogenous lipids of other GPCRs can 
be readily replaced”, but the size and engagement of the endogenous 
lipids bound to the GABA, receptor suggest that they are essential for 
maintaining receptor integrity and stability. 

To explore the physiological relevance of the endogenous phos- 
pholipids, we mutated residues that hydrogen bonded with the 
phosphate head group. We identified R714A in GABA,, ECL3, which 
displayed a small gain of function despite reduced cell surface expres- 
sion (Extended Data Fig. 8g, h). This mutation is expected to enhance 
the movement of PC 38:2 within GABAg, by eliminating a critical 
interaction with the lid. Our results suggest that PC 38:2 may act as 
a negative allosteric modulator of GABA, receptor by stabilizing the 
inactive conformation of GABA,). 


Comparison of subunits with other GPCRs 


The GABA,;, and GABA,, subunits have highly similar VFT and 
transmembrane components but differ in their relative orientation 
(Extended Data Fig. 9a, b). Each subunit differs from all other class C 
GPCRs in possessing an extended peptide linker between the VFT and 
transmembrane domains instead of a cysteine-rich domain (Fig. 1c). 

GABA, subunits also display distinct helix positions in the seven-helix 
bundles among inactive GPCRs (Extended Data Fig. 9c-f). However, 
these differences are minor compared with the dramatic movement of 
TM6inclass A and B GPCRs when activated by G-protein coupling”*”®. 
This corroborates our conclusion that we have captured an inactive 
conformation of the GABA, receptor. 

The GABA, receptor presents unique variations of conserved trans- 
membrane motifs (Extended Data Fig. 10a, b). In most class AGPCRs 
suchas rhodopsin”, the ‘ionic lock’ tethers TM3 and TM6to stabilize the 
inactive state of an individual transmembrane domain” (Extended Data 
Fig. 10c). The ‘ionic locks’ of both GABA, subunits consist of an aspartate 
from ICL3 and a lysine from TM3 (Lys**’) (Extended Data Fig. 10a, b). 
Although only the Lys/Asp pair in GABA,,, are within hydrogen-bond 
distance, their backbone Ca-Ca separations (9.3-9.4 A) are comparable 
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Fig. 4| Identification of endogenous phospholipid ligands of the GABA, 
receptor. a,c, Liquid chromatography with tandem mass spectrometry 
(LC-MS/MS) analysis of phospholipids bound to the GABA, receptor (n=1). The 
LC traces show the abundance of phospholipids PC 38:2 (a) and PE 38:5 (c) in 
GABAs preparation relative to a CaS receptor control. b, d, High-resolution MS 
spectra of the peaks ina, c (black) matched with standard spectra of PC 38:2 (b) 
and PE 38:5 (d) inred. NL, neutral loss: a fragment ina neutral charge state 
during collision-induced dissociation; SN1 and SN2, stereospecific numbering 


to that of rhodopsin” (8.7 A) and inactive mGlu receptors” (10.9-11.2 A), 
indicating that the ‘ionic lock’ is intact within both GABA, subunits 
(Extended Data Fig. 10a-e). 

The ‘ionic lock’ in each GABA, subunit resides in close proxim- 
ity to an FxPKxx sequence in TM7, which is the counterpart of the 
here: «F motif in class A GPCRs”. Specifically, the conserved 

Lys’* participates in a network of hydrophilic contacts with the ‘ionic 
ieee through Asn*” and a serine in ICL1 (Extended Data Fig. 10a, b 
and Supplementary Fig. 3). These interactions unite the ‘ionic lock’ 
and FxPKxx motif into a larger and integral system for maintaining 
the inactive transmembrane conformation of individual GABA, 
subunits. 


Conclusion 

The combination of our previous VFT structures and present cryo-EM 
data supports the occurrence of three essential events during acti- 
vation of the GABA, receptor: first, agonist-induced closure of 
the GABA,,, VFT; second, association of membrane-proximal LB2 
domains; and third, dissociation of the ‘intersubunit latch’ and ensu- 
ing rearrangement of the transmembrane heterodimer interface. 
This hypothesis is consistent with our finding that an inverse ago- 
nist bound to the extracellular domain can inhibit the constitutive 


308 | Nature | Vol584 | 13 August 2020 


of glycerophospholipids, marking the fatty acyl groups at respectively the first 
and second carbons of the glycerol. e, f, Molecular surface of the GABAg;, 

(e) and GABA,, (f) transmembrane domains, showing the binding pockets for 
PE 38:5 and PC 38:2, respectively. g-j, Three views of the specific interactions 
between phospholipids and each GABAg subunit. Contacts are shown between 
the phospholipid head group and residues from GABA,,, (g) and GABAg, (h), 
and between phospholipid fatty acyl chains and GABAg,, (i) and GABA, (j). 


activity stemming from the spontaneous closure of the GABAg,, VFT, 
but not that resulting from direct ‘downstream’ disruption of the 
‘intersubunit latch’. 

The GABA,-receptor structure also yields surprising findings regard- 
ing its endogenous ligand composition. We suspect that the phospho- 
lipids discovered well inside each transmembrane cavity are necessary 
structural components, as they are prebound within each subunit and 
their interactions with the receptor are extensive. These endogenous 
lipids may be unique to the GABA, receptor, as the lipid-binding resi- 
dues are not conserved among class C GPCRs (Supplementary Fig. 3). 
Preoccupation of the transmembrane pocket suggests that GABA, 
allosteric modulators may bind to yet unknown sites, with the heter- 
odimer interface being a potential location. An active structure of the 
GABA, receptor would confirm whether the phospholipids are integral 
receptor components or functional modifiers. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Protein expression and purification 

Human GABA,,, (UniProt (https://www.uniprot.org) code 
Q9UBSS-2) and GABA,, (UniProt code 075899) subunits were each 
cloned into a modified pEG BacMam vector” for coexpression in 
baculovirus-infected mammalian cells. GABA;,, and GABA,,, are two 
major isoforms of GABAg;, and have identical pharmacological pro- 
files’. Different C-terminal truncations of each receptor subunit were 
tested for heterodimeric receptor assembly. The optimal GABA,;, 
expression construct consisted of residues 1-802 (GABA,;,(1-802)), 
while the GABA,, construct included residues 1-819 (GABA,,(1-819)). 
This allowed the heterodimeric construct to be transported to the cell 
membrane, as it retained the intracellular coiled-coil region present 
inthe intracellular tail of each subunit™”. Signal peptides for GABAg;, 
and GABA,, occupied residues 1-29 and 1-41, respectively. A Flag tag 
was engineered at the C terminus of each subunit to facilitate affinity 
purification. 

Human embryonic kidney (HEK) 293 GnTI cells®? were grown in 
suspension culture at 37 °C in 8% CO, using 293 freestyle media (Life 
Technology, Carlsbad, CA). The cells were coinfected with recombinant 
baculoviruses carrying the GABAg;,(1-802) and GABA,;,(1-819) genes 
at 37 °C. To enhance the expression level, we added 10 mM sodium 
butyrate 18 h post infection, and incubated the cells for a further 72h 
at 30 °C before harvest. 

Cell membrane was isolated by differential centrifugation. The cells 
were lysed using an EmulsiFlex-C3 high pressure homogenizer (Avestin, 
Ottawa, Canada) ina buffer containing 50 mM HEPES, pH 7.5, 150 mM 
NaCl, 10% glycerol and a cocktail of protease inhibitors (Roche, Basel, 
Switzerland). Cell debris was removed by centrifugation of the lysed 
cell suspension at 10,000 r.p.m. The cell membrane was then pelleted 
by ultracentrifugation at 45,000 r.p.m. 

The GABA, receptor was extracted from the cell membrane with 
50 mM HEPES, pH 7.5, 150 mM NaCl, 10% glycerol, 1% lauryl maltose 
neopentyl glycol (LMNG) (Anatrace, Maumee, OH) and 0.2% choles- 
teryl hemisuccinate (CHS) (MilliporeSigma, St. Louis, MO) at 4 °C over- 
night. After the insoluble matter was removed by centrifugation, the 
supernatant was applied to an anti-Flag M2 antibody affinity column 
(MilliporeSigma, St. Louis, MO). The column was washed stepwise with 
decreasing concentrations of detergent, from 0.1% to 0.002% LMNG. 
The heterodimeric GABA,,,(1-802)-GABA,,(1-819) complex was then 
eluted withSO mM HEPES, pH 7.5, 50 mM NaCl, 0.002% LMNG, 0.0004% 
CHS and 0.2 mg mI! Flag peptide. 

The receptor was further purified by Mono Q (GE Healthcare, Chi- 
cago, IL) ion-exchange chromatography using a linear salt gradient 
from 50 mM to1M NaCl in 50 mM HEPES, pH 7.5, 0.002% LMNG and 
0.0004% CHS. Finally, the assembled GABA, receptor was subjected to 
Superose 6 (GE Healthcare, Chicago, IL) size-exclusion chromatography 
inSO mM HEPES, pH 7.5, 50 mM NaCl, 0.002% LMNG and 0.0004% CHS. 

HEK 293 GnTI cells were purchased from and authenticated by the 
American Type Culture Collection (ATCC catalogue number CRL-3022). 
Cell morphology was examined for each passage of cells. The cells were 
certified by ATCC to be free of mycoplasma contamination, but they 
were not tested again during culturing. 


Cryo-EM specimen preparation and data acquisition 

Specimens were composed of vitrified GABA, protein samples occupy- 
ing UltraAuFoil R 0.6/1, 300-mesh holey Au/Au grids (Quantifoil Micro 
Tools, Jena, Germany). The surfaces of the grids were rendered hydro- 
philic by glow-discharging using H, and O, for 25s at 10 W witha Solarus 
950 plasma cleaner system (Gatan, Pleasanton, CA). For vitrification, 


3 pl of purified GABA, receptor at a concentration of approximately 
0.3 mg mI was applied to each grid, blotted for 4 s at a blot force of 3 
inside a Vitrobot Mark IV (Thermo Fisher Scientific, Waltham, MA), and 
plunge-frozen in a liquid propane:ethane mixture (63:37, v/v) cooled 
with liquid nitrogen. 

Data collection was performed ona Titan Krios transmission electron 
microscope (Thermo Fisher Scientific, Waltham, MA) equipped with 
aK2 Summit direct electron detection camera (Gatan, Pleasanton, 
CA) in counting mode and a post-column GIF Quantum energy filter 
(Gatan) in zero-energy-loss mode with a slit width of 20 eV. Micrographs 
were accrued at a calibrated pixel size of 1.06 A and with nominal defo- 
cus range of —0.5 to —2 pm. Each micrograph consisted of 60 frames 
collected over a 12-s exposure at a dose rate of roughly 8 electrons 
per pixel per second for a total dose of roughly 85 electrons per A2. 
A total of 3,435 micrographs were acquired as dose-fractionated image 
stacks. 


Cryo-EM image processing 

Image processing began with frame alignment and dose-weighting of 
the image stacks using the CPU-based implementation of MotionCor2 
(ref. **) in Relion 3.0 (ref. *°). Estimation of contrast transfer function 
(CTF) for each non-dose-weighted micrograph was determined by 
Gctf** version 1.06. After visual inspection of the micrographs and their 
power spectra, 3,334 were selected for further processing. 

Approximately 3,000 particles were manually picked in Relion 3.0 
(ref. *), and extracted from a 4x binned data set with a pixel size of 
4.24 A. These data produced an initial set of two-dimensional (2D) 
classes that were used as templates to select 1,048,241 particles auto- 
matically, all of which were subsequently imported into cryoSPARC 
version 2 (ref. *”) for extensive 2D classification. After elimination of 
unfit classes, a total of 312,840 particles from the high-quality 2D classes 
were combined to produce an ab initio three-dimensional (3D) refer- 
ence in cryoSPARC version 2 (ref. *”). On the basis of the ab initio model, 
particles were re-extracted at full scale from the unbinned data set with 
a pixel size of 1.06 A in Relion 3.0 (ref. *), and re-introduced into cry- 
oSPARC version 2 (ref. *”) for 3D refinement. Homogeneous refinement 
of the ab initio 3D model against the unbinned set of particles yieldeda 
density map with nominal resolution of 3.6 A according to the Fourier 
shell correlation (FSC) = 0.143 gold standard criterion*®. Heterogenous 
refinement of multiple models obtained before and after homogeneous 
refinement allowed removal of additional poor-quality particles and 
reduced the particle count to 233,737. Nonuniform refinement then 
improved the resolution to 3.5 A. At this point, CTF refinement and 
Bayesian polishing were conducted in Relion 3.0 (ref. *°), followed by 
an additional round of nonuniform refinement in cryoSPARC version 2 
(ref. 2”), further improving the resolution to 3.3 A. 

Although the transmembrane domains of the GABA, receptor 
exhibit pseudo twofold symmetry, bulky carbohydrate densities that 
are visible only inthe ECD of the GABA,,, subunit resulted in sufficient 
low-resolution asymmetry to prevent particle misalignment. These 
densities include partial carbohydrate densities attached to Asn 323 
and Asn 365 of GABA,,, that do not have counterparts in GABA,. 

The global density map exhibited directional anisotropy” that is 
caused by interdomain movement about a flexible linker. To eliminate 
the adverse effect of such movement on map quality, we performed 
local refinement of the ECD and transmembrane domains of the GABA, 
receptor independently. A mask was created covering each region, 
and the nonuniform refinement algorithm was used as implemented 
in cryoSPARC version 2 (ref. *”). The resulting reconstructions for the 
individual ECD and transmembrane domains reached 3.1 A and 3.4 A 
resolution, respectively. A composite map was generated in UCSF Chi- 
mera*? by taking the maximum values pointwise from the two locally 
refined maps after alignment to the global reconstruction (‘vop maxi- 
mum’ command in UCSF Chimera*®). This composite map was used for 
subsequent model building and refinement. 


Resolutions of cryo-EM reconstructions were determined using a 
cutoff value of 0.143 in gold standard half-map FSC curves*®. 

We carried out 3D variability analysis*in cryoSPARC version 2 (ref. *”) 
using the 233,737 particles from global nonuniform refinement as input. 
Calculations were performed for the entire receptor, the ECDs and the 
transmembrane domains, respectively. In each case, multiple modes of 
variability were solved, and represented as eigenvectors along which 
conformational changes occur. To visualize the transformation of den- 
sity, we calculated five reconstructions along each eigenvector, witha 
filter resolution of 4.5 A. A movie that combines these reconstructions 
as frames was generated in Chimera“ for each dimension of motion. 


Model building and refinement 

Model building was carried out in COOT™. The crystal structure of a 
complex of human GABA,,, VFT and GABA,, VFT inthe apo form (Protein 
DataBank code 4MQE) was used as the initial model for the extracel- 
lular domain of the receptor. The VFT modules of GABA,;,, and GABA, 
were separately placed into density as rigid bodies. Individual residues 
were then adjusted to optimize the fit. The linker and transmembrane 
domain of each subunit were traced de novo based on the density. 
The final model contained residues 48-368, 377-576 and 588-747 of 
GABAgy,, and 54-294, 302-376, 385-584 and 595-749 of GABAg>. 

In addition to the polypeptide chains, we built models for a Ca” ionin 
the extracellular domain of GABA,,,, and for one phospholipid (PE38:5 
in GABA,;, and PC 38:2 in GABA,,) within the transmembrane domain of 
each subunit. Density for carbohydrate was observed at three N-linked 
glycosylation sites on GABA,,, (Asn 365, Asn 385 and Asn 397) and one 
site on GABA,, (Asn 404). An N-glucosamine residue was modelled at 
each of these glycosylation sites. Continuous density was also identi- 
fied for ten cholesterol or CHS molecules surrounding the transmem- 
brane domains of both GABA, subunits. Cholesterols were modelled to 
optimize the fit with density; however, these densities may belong to 
CHS molecules with disordered parts. Density for an endogenous ligand 
was found at the interdomain cleft of the GABA,,, VFT. Although this 
density could be fit with GABA, it was not modelled because the origin 
and identity of the ligand remained ambiguous without confirmation 
by anindependent method. 

The entire structure was refined by the real-space refinement algo- 
rithm and validated with the comprehensive validation application 
as implemented in Phenix*’. Ramachandran statistics were calculated 
using MolProbity**. The refined model also has an overall EMRinger*® 
score of 2.7, while the extracellular and transmembrane domains have 
scores of 3.4 and 1.8, respectively. The final model revealed that VFT 
and transmembrane components are related to their counterparts in 
the other subunit by 177° and 179° rotations about the vertical axis, 
respectively. 

Pairwise structural alignment was performed using LSQMAN*. 
Figures were generated using Pymol Molecular Graphics System version 
2.3 (Schrédinger), UCSF Chimera*® and UCSF ChimeraX”. Software 
installation support was provided by SBGrid**. 


Scintillation proximity assay 

Binding of ?H]GABA (60 Cimmol™; American Radiolabelled Chemicals, 
St. Louis, MO) tothe GABA, receptor was measured with ascintillation 
proximity assay (SPA)*?°°. Purified GABA;;,(1-802)-GABA,,(1-819) com- 
plex (100 ng) was extensively dialysed against the purification buffer 
to remove any residual endogenous ligand. The dialysed protein was 
then immobilized to yttrium silicate (YSi) protein A SPA beads (62.5 pg; 
PerkinElmer, Waltham, MA) using anti-Flag M2 antibody (12.5 pg; 
Millipore Sigma, St. Louis, MO), and incubated at 4 °C for 30 min inthe 
same buffer as used for the final step of protein purification (SO mM 
Hepes, pH 7.5, 50 mM NaCl, 0.002% LMNG, 0.004% CHS). Increas- 
ing concentrations (ranging from 0.2 1M to 25 pM) of H]GABA 
(2.5 Cimmol" final specific radioactivity) were added to the protein/ 
antibody/SPA-bead mixture, and the samples were allowed to reach 


equilibrium at 4 °C for 16 h. A reaction performed with an antibody/ 
SPA-bead mixture in the absence of GABA, receptor was used to deter- 
mine the nonproximity signal originating from nonspecific interaction 
between the radioligand and SPA beads. 

All samples were counted in a Microbeta counter (PerkinElmer, 
Waltham, MA) in counts per minute (c.p.m.) inthe SPA mode. The effi- 
ciency of detection was calculated with a standard curve of known 7H] 
GABA concentrations, and this was used to transform c.p.m. into pmol. 
Specific binding was determined by subtracting the nonproximity sig- 
nal (nonspecific binding) from the total binding signal and was plotted 
as a function of free radioligand concentration. Nonlinear regression 
fitting of the data was performed in SigmaPlot 13.0 to obtain the disso- 
ciation constant (K,) and the molar ratio of GABA-to-receptor binding. 


Identification of phospholipid ligands and GABA 

Identification ofbound endogenous lipids was conducted as described” 
with modifications. Briefly, intact GABA, and control CaS receptor were 
digested with trypsin overnight at 37 °C. Digested proteins were dried 
and extracted with 1 ml of ice-cold methanol:water (9:1, v/v). The super- 
natants were dried and resuspended with methanol:toluene (9:1, v/v) 
to an equivalent concentration of 2 1M. For LC-MS/MS analysis”, the 
lipid extracts were separated ona Waters Acquity UPLC CSH C18 column 
(100 x 2.1 mm; 1.7 pm) (Waters, Milford, MA) coupled to an Acquity 
UPLC CSH C18 VanGuard precolumn (5 x 2.1mm; 1.7 ppm) (Waters). The 
column was maintained at 65 °C at a flowrate of 0.6 ml min’. The mobile 
phases consisted of A, acetonitrile:water (60:40, v/v) with ammonium 
formate (10 mM) and formic acid (0.1%), and B, 2-propanol:acetonitrile 
(90:10, v/v) with ammonium formate (10 mM) and formic acid (0.1%). 
The 15-min separation was conducted under the following gradient: 
Omin15% B; 0-2 min 30% B; 2-2.5 min 48% B; 2.5-11 min 82% B; 11-12 min 
99% B; 12-15 min 15% B. A Q Exactive HF mass spectrometer (Thermo 
Fisher Scientific, Waltham, MA) was operated in electrospray ioniza- 
tion (ESI) in positive mode with the following parameters: mass range 
100-1,500 m/z; spray voltage +3.6 kV; sheath gas (nitrogen) flow rate 60 
units; auxiliary gas (nitrogen) flow rate 25 units; capillary temperature 
320 °C; full scan MS1 mass resolving power 60,000; data-dependent 
MS/MS acquisition (dd-MS/MS) four scans per cycle; dd-MS/MS mass 
resolving power 15,000. The mass features that were differentially 
higher in the GABA, receptor were subjected to targeted MS/MS in 
reinjections to acquire tandem mass spectra. Thermo Xcalibur ver- 
sion 4.0.27.19 was used for data acquisition. Data processing and 
identification were performed in MS-DIAL version 3.40. Identifica- 
tion was conducted by matching accurate mass, tandem mass spectra 
and chromatographic retention time with built-in lipid library Lipid- 
Blast>*. The identified endogenous lipids bound to GABA, receptor 
include phosphatidylcholine (PC) 38:2 (PC(18:1_20:1); International 
Chemical Identifier (InChI) key: QLEJPADMSQQACL-WWUFLCHTSASN), 
and phosphatidylethanolamine (PE) 38:5 (PE(18:1_20:4); InChIKey 
VFUVYNGTMNUBMF-ZRVIQYDLSA-N). Although many isoforms exist 
for these lipids, both phospholipids share two long-chain fatty acyl 
moieties of 18 and 20 carbons based on mass spectrometry fragmenta- 
tion pattern and biological relevance. 

Identification of GABA was conducted with targeted LC-MS/MS. 
Briefly, cell supernatant, cell lysate, culture media and lysis buffer 
controls were dried from 1 ml and extracted with 1 ml of ice-cold 
methanol:water (9:1, v/v). The supernatants were dried and resus- 
pended with 200 ul of acetonitrile:water (8:2, v/v). The extracts were 
separated ona Waters Acquity UPLC BEH Amide column (150 x 2.1mm; 
1.7 um) (Waters, Milford, MA) coupled with an Acquity UPLC BEH Amide 
VanGuard precolumn (5 x 2.1 mm; 1.7 pm) (Waters). The column was 
maintained at 45 °C at a flow rate of 0.4 ml min”. Mobile phase A was 
water with ammonium formate (10 mM) and formic acid (0.125%) while 
B was acetonitrile:water (95:5, v/v) with ammonium formate (10 mM) 
and formic acid (0.125%). Separation was conducted using the gradi- 
ent: O-2 min 100% B; 2-7.7 min 70% B; 7.7-9.5 min 40% B; 9.5-12.5 min 
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30% B; 10.25-12.75 100% B, 12.75-16.75 100% B. A Q Exactive HF mass 
spectrometer was operated with the same parameters as above. AGABA 
standard was injected along with the samples to confirm its spectrum 
and retention time. The responses of GABA in samples were normalized 
to that of GABA standard with known concentration. 


Cell surface expression 

Full-length human GABA,,, and GABA,, were each cloned into a 
pcDNA3.1(+) vector (Life Technologies, Carlsbad, CA), with a Flag tag 
inserted after the signal peptide of GABA,,,,and a haemagglutinin (HA) tag 
after the signal peptide of GABA,,. Similar constructs were also generated 
forthe C-terminally truncated GABA,;,(1-802) and GABA;,(1-819). Single 
mutants of full-length GABA,,, (E673R, E309K and E423R) and GABA, 
(H579E and R714A) were constructed using the QuikChange mutagenesis 
system (Agilent Technologies, Santa Clara, CA). 

The cell surface expression levels of wild-type and mutant GABA, 
receptor were measured as described”. Briefly, HEK293 T/17 cells 
(ATCC) were cultured in monolayer in DMEM/F12 media (Life Technol- 
ogy, Carlsbad, CA) supplemented with 10% fetal bovine serum (FBS) 
at 37 °C in the presence of 5% CO,. The cells were co-transfected with 
GABA, and GABA,, plasmids using Lipofectamine 3000 (Life Tech- 
nologies, Carlsbad, CA). Each GABA,,, and GABA,, mutant was paired 
with its wild-type partner. Given that GABAg,, is retained inside the cells 
unless it is chaperoned by GABA,,, we used the surface expression level 
of GABAgy, on intact cells to measure the amount of assembled heter- 
odimeric GABA, receptor on the cell surface. The amount of surface 
GABAg,, protein detected for each pair of constructs was normalized 
with the cell count in each experiment. The cell surface expression level 
of each mutantis calculated as a percentage of the wild-type receptor. 

After blocking with 1% BSA, the cells were incubated with mouse 
anti-Flag M1 antibody (MilliporeSigma, St. Louis, MO) as the pri- 
mary antibody to measure GABAg,, expression, followed by donkey 
anti-mouse IRDye 800-labelled antibody (Li-Cor Biosciences, Lincoln, 
NE) as the secondary antibody. Fluorescent signals were measured with 
an Odyssey Infrared Imager (Li-Cor Biosciences). Each experiment was 
performed in triplicate. 

HEK 293 T/17 cells were purchased from and authenticated by ATCC 
(catalogue number CRL-11268). Cell morphology was examined for each 
passage of cells. The cells were certified by ATCC to be free of myco- 
plasma contamination, but they were not tested again during culturing. 


Inositol phosphate measurement 

HEK293 T/17 cells were cotransfected with plasmids encoding GABAgy,, 
GABAgy, and Ga, ;;. The Ga,;; chimaera was constructed by replacing the 
five C-terminal amino acids of murine Ga, with those of murine Gq; 
(ref. 8). Exchanging the C-terminal end of Ga, with that of Ga,,, permits 
it to couple with the GABA, receptor and allows the functional activity 
of the receptor to be tracked through the activity of phospholipase C 
(PLC). Control experiments were conducted using cells transfected 
with an empty pcDNA3.1(+) vector, Ga,js alone, or wild-type GABAgi, 
and GABA,, in the absence of Gays. 

Inositol phosphate accumulation was quantified with the 
homogenous time-resolved fluorescence (HTRF) IP-one Tb kit 
(Cisbio Bioassays, Codolet, France), which measures the accu- 
mulation of inositol-1-monophosphate (IP,), a metabolite of 
inositol-1,4,5-triphosphate (IP). One day after transfection, the cells 
were stimulated with increasing concentrations of baclofen for one 
hour at 37 °C. The stimulated cells were lysed, and the native IP, that had 
been produced was incubated with a d2 fluorophore-labelled IP, ana- 
logue (the acceptor) to compete for binding to an Eu Cryptate-coupled 
anti-IP1 monoclonal antibody (the donor). Fluorescence data were 
collected at 620 nm and 665 nm with a PHERAstar FS plate reader 
(BMG Labtech, Cary, NC) after laser excitation at 320 nm. The fluores- 
cence resonance energy transfer (FRET) signal was calculated as the 
fluorescence ratio (665 nm/620 nm) and is inversely proportional to 


the concentration of native IP, produced following GABA, activation 
through a chimaeric Ga,;; G protein. The agonist-induced receptor 
response of each mutant was calculated as a percentage of the maxi- 
mumactivity of wild-type receptor relative to the activity observed for 
Ga, i; alone. Basal activity was determined in the absence of baclofen 
stimulation and calculated similarly to the agonist-dependent receptor 
response. Data analysis was performed using the nonlinear regression 
algorithms in Prism (GraphPad Software, San Diego, CA). Data points 
represent averages + s.e.m. of multiple experiments, each consisting 
of quadruplicate measurements. 

Application of a known antagonist, CGP54626, reduced agonist 
potency as expected. In addition, the compound lowered the basal 
activity of the GABA, receptor, indicating that it has inverse agonist 
activity, as previously reported™. Therefore, we refer to the compound 
as aninverse agonist. 


Inductively coupled plasma mass spectrometry 

Purified GABA, receptor (200 ul, 11.1 pg pl or 57.7 pM) was collected in 
metal-free tubes and digested overnight by adding 1 ml of concentrated 
nitric acid (HNO,) (Fisher Scientific, Hampton, NH; Optima grade). The 
digested protein was then diluted to a total volume of 10 ml with 8.7 
ml of deionized water supplemented with 500 pg I" of gold (Au) and 
100 pI of an internal standard solution containing 500 pg I each of 
gallium (Ga) and rhodium (Rh) in 2% HNO,. The protein purification 
buffer (200 pl), containing 10 mM HEPES, pH 7.5, 50 mM NaCl, 0.002% 
LMNG and 0.0004% CHS, was similarly mock digested and diluted for 
analysis. 

Inductively coupled plasma mass spectrometry (ICP-MS) was 
conducted using a NexION 350S ICP-MS instrument (Perkin Elmer, 
Waltham, MA) equipped with dynamic reaction cell (DRC) feature and 
a SC-4 DX FAST Autosampler (Elemental Scientific, Omaha, NE). The 
DRC-ICP-MS experimental method was developed from published pro- 
cedures and a laboratory protocol for multi-element DCR-ICP-MS 
from the Centers for Disease Control (CDC) (https://www.cdc.gov/nchs/ 
data/nhanes/nhanes_13 14/UM_UMS _UTAS_UTASS H_ MET.pdf). The 
concentrations of magnesium (Mg), calcium (Ca), manganese (Mn), 
iron (Fe), cobalt (Co), nickel (Ni), copper (Cu), zinc (Zn) and strontium 
(Sr) in the digested protein and buffer samples were measured. Data 
points represent averages + coefficient of variance from eight measure- 
ments within two experiments. 

One multi-element calibration standard was prepared from con- 
centrated single-element stocks, and used for instrument calibration. 
The calibration standard was diluted to various concentrations using a 
solution containing 10% HNO, and 500 pg!" of Auto cover the expected 
concentration range of each analyte in the protein sample: 0.01, 0.02, 
0.05, 0.1, 0.2 wg I for Co and Sr; 0.05, 0.1, 0.25, 0.5, 1.0 pg I for Mn and 
Ni; 0.25, 0.5, 1.25, 2.5, 5.0 pg I" for Mg and Cu; 0.5, 1.0, 2.5, 5.0, 10.0 pg I 
for Fe and Zn; and 2.5, 5.0, 12.5, 25,50 pg I for Ca. 

The instrument was also calibrated against a set of blank solutions, 
including commercially available quality controls containing digested 
hair samples from Public Health Expertise and Reference Center, 
Quebec (INSPQ, Quebec, Canada), and a water sample containing a 
broad range of metals from the National Institute of Technology (NIST, 
Gaithersburg, MD). 

Special attention was paid to correction for matrix-induced interfer- 
ences. Matrix suppression was compensated by the selection of suitable 
internal standards, which were matched to masses and, if possible, to 
ionization properties of the analytes. The internal standards were added 
to each calibration standard and quality control sample to the same 
final concentration as that in the protein sample and buffer (5 pg leach 
of Ga and Rh). The elements Mg, Ca, Mn, Fe, Co, Ni, Cu, and Zn were 
corrected with Ga, while Sr was corrected with Rh. Polyatomic interfer- 
ences were suppressed with the instrument’s DRC technology feature, 
using ammonia as a second gas for Mn and Fe, while Mg, Ca, Sr, Co, Ni, 
Cu, and Zn were measured in standard mode without a second gas. 


Matrix-assisted laser desorption/ionization 

Purified GABAg,,,(1-802)-GABA,,(1-819) complex (0.3 mg mI, 1 p11) 
was mixed with 1 pl sinapinic acid matrix solution (Bruker Daltonics, 
Billerica, MA) containing 10 mg of sinapinic acid in 1 ml of 2.5% trif- 
luoroacetic acid (MilliporeSigma, St. Louis, MO) and 50% acetonitrile 
(MilliporeSigma). The protein—matrix suspension (2 pl) was added to 
the ground-steel matrix-assisted laser desorption/ionization (MALDI) 
target plate and dried at room temperature. Mass spectra were collected 
using a Ultraflextreme MALDI time-of-flight (TOF)/TOF mass spectrom- 
eter (Bruker Daltonics) operated with FlexControl software in linear 
positive mode-thatis, using a mass range of 30,000-120,000 daltons. 
The instrument was externally calibrated with a Proteins MALDI-MS 
calibration kit (MilliporeSigma, St. Louis, MO). Each individual mass 
spectrum was analysed and adjusted for smoothness and baseline 
using FlexAnalysis software 3.0 (Bruker Daltonics). We determined the 
molecular mass of the heterodimeric GABA,;,(1-802)-GABA,,(1-819) 
complex to be 192,647.967 daltons. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All data are included in this paper and its Supplementary Information. 
Cryo-EM density maps of the GABA, receptor have been deposited inthe 
Electron Microscopy Data Bank (https://www.ebi.ac.uk/pdbe/emdb/) 
under accession code EMD-21685. Atomic coordinates for the GABA, 
receptor structure have been deposited in the RCSB Protein Data Bank 
under accession code 6WIV. Raw cryo-EM images have been deposited 
in the Electron Microscopy Public Image Archive (https://www.ebi. 
ac.uk/pdbe/emdb/empiar/) under accession code EMPIAR-10410. 


30. Goehring, A. et al. Screening and large-scale expression of membrane proteins in 
mammalian cells for structural studies. Nat. Protocols 9, 2574-2585 (2014). 

31. Margeta-Mitrovic, M., Jan, Y. N. & Jan, L. Y. A trafficking checkpoint controls GABA, 
receptor heterodimerization. Neuron 27, 97-106 (2000). 

32. Pagano, A. et al. C-terminal interaction is essential for surface trafficking but not for 
heteromeric assembly of GABA, receptors. J. Neurosci. 21, 1189-1202 (2001). 

33. Reeves, P. J., Callewaert, N., Contreras, R. & Khorana, H. G. Structure and function in 
rhodopsin: high-level expression of rhodopsin with restricted and homogeneous 
N-glycosylation by a tetracycline-inducible N-acetylglucosaminyltransferase I-negative 
HEK2935S stable mammalian cell line. Proc. Natl Acad. Sci. USA 99, 13419-13424 (2002). 

34. Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for 
improved cryo-electron microscopy. Nat. Methods 14, 331-332 (2017). 

35. Scheres, S. H. RELION: implementation of a Bayesian approach to cryo-EM structure 
determination. J. Struct. Biol. 180, 519-530 (2012). 

36. Zhang, K. Gctf: real-time CTF determination and correction. J. Struct. Biol. 193, 1-12 (2016). 

37. Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid 
unsupervised cryo-EM structure determination. Nat. Methods 14, 290-296 (2017). 

38. Scheres, S. H. & Chen, S. Prevention of overfitting in cryo-EM structure determination. 
Nat. Methods 9, 853-854 (2012). 

39. Tan, Y. Z. et al. Addressing preferred specimen orientation in single-particle cryo-EM 
through tilting. Nat. Methods 14, 793-796 (2017). 

40. Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and 
analysis. J. Comput. Chem. 25, 1605-1612 (2004). 

41. Punjani, A. & Fleet, D. J. 3D variability analysis: directly resolving continuous flexibility and 
discrete heterogeneity from single particle cryo-EM images. bioRxiv https://doi. 
org/10.1101/2020.04.08.032466 (2020). 

42. Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. 
Acta Crystallogr. D 66, 486-501 (2010). 


43. Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular 
structure solution. Acta Crystallogr. D 66, 213-221 (2010). 

44. Chen, V.B. et al. MolProbity: all-atom structure validation for macromolecular 
crystallography. Acta Crystallogr. D 66, 12-21 (2010). 

45. Barad, B. A. et al. EMRinger: side chain-directed model and map validation for 3D 
cryo-electron microscopy. Nat. Methods 12, 943-946 (2015). 

46. Novotny, M., Madsen, D. & Kleywegt, G. J. Evaluation of protein fold comparison servers. 
Proteins 54, 260-270 (2004). 

47. Goddard, T. D. et al. UCSF ChimeraX: meeting modern challenges in visualization and 
analysis. Protein Sci. 27, 14-25 (2018). 

48. Morin, A. et al. Collaboration gets the most out of software. eLife 2, e01456 (2013). 

49. Geng, Y. et al. Structure and functional interaction of the extracellular domain of human 
GABA, receptor GBR2. Nat. Neurosci. 15, 970-978 (2012). 

50. Quick, M. & Javitch, J. A. Monitoring the function of membrane transport proteins in 
detergent-solubilized form. Proc. Natl Acad. Sci. USA 104, 3603-3608 (2007). 

51. Gupta, K. et al. Identifying key membrane protein lipid interactions using mass 
spectrometry. Nat. Protocols 13, 1106-1120 (2018). 

52. Mafu, S. et al. Biosynthesis of the microtubule-destabilizing diterpene pseudolaric acid B 
from golden larch involves an unusual diterpene synthase. Proc. Natl Acad. Sci. USA 114, 
974-979 (2017). 

53. Kind, T. et al. LipidBlast in silico tandem mass spectrometry database for lipid 
identification. Nat. Methods 10, 755-758 (2013). 

54. Mukherjee, R. S., McBride, E. W., Beinborn, M., Dunlap, K. & Kopin, A. S. Point mutations in 
either subunit of the GABA; receptor confer constitutive activity to the heterodimer. Mol. 
Pharmacol. 70, 1406-1413 (2006). 

55. Chen, K.L. B., Amarasiriwardena, C. J. & Christiani, D. C. Determination of total arsenic 
concentrations in nails by inductively coupled plasma mass spectrometry. Biol. Trace 
Elem. Res. 67, 109-125 (1999). 

56. Pruszkowski, E., Neubauer, K. & Thomas, R. An overview of clinical applications by 
inductively coupled plasma mass spectrometry. Atomic Spectroscopy 19, 111-115 (1998). 

57. Hollenstein, K. et al. Structure of class B GPCR corticotropin-releasing factor receptor 1. 
Nature 499, 438-443 (2013). 

58. Wang, C. et al. Structure of the human smoothened receptor bound to an antitumour 
agent. Nature 497, 338-343 (2013). 


Acknowledgements We thank R. Henderson for early-stage cryo-EM investigation and critical 
reading of the manuscript; C. S. Zuker for advice and financial support; C. Karan and R. Realubit 
for assistance with the EnVision plate reader at Columbia Genome Center; A. Sobolevsky, 

K. Saotome and E. Cao for BacMam vectors; Y. H. Wong for a Ga,is chimera plasmid; and 

B. K. Kobilka and B. Skiniotis for advice. Titan Krios data collection was performed at the 
Simons Electron Microscopy Center, directed by B. O. Carragher and C. Potter, supported by 
grants from the Simons Foundation (SF349247), NYSTAR and the National Institutes of Health 
(NIH; GM103310). This work was supported by NIH grants RO1IGMO088454 (to Q.R.F.), 
RO1GM125801 (to Q.R.F., P.A.S. and M.Q.), ROIGM107462 (to W.A.H.), P41GM116799 (to W.A.H.), 
and U2C ES030158 for lipid identification (to O.F.). Q.R.F. was an Irma Hirschl Career Scientist, 
Pew Scholar, McKnight Scholar and Schaefer Scholar. 


Author contributions J.P., J.L., Q.R.F. and K.M.R. cultured cells and purified protein; Z.F. and 
A.F. prepared cryo-EM grids; Z.F., A.F., Q.R.F. and O.B.C. collected cryo-EM data; Z.F. and 

A.F. performed initial image processing; O.B.C. processed cryo-EM data to high resolution; 
L.M. and Q.R.F. built and refined models; Q.R.F., L.M., A.F., J.P. and Z.F. analysed structures; 

T.S. and O.F. identified phospholipids and GABA by mass spectrometry; V.N.S. and J.G. 
conducted elemental analysis; Q.R.F, J.L., J.P., A.F., J.T., X.L. and J.P.W. performed mutagenesis 
and cell-based functional assays; M.Q. performed and analysed radioligand binding studies; 
B.C, Y.G., H.Z. and Y.K. generated expression plasmids and carried out early protein purification 
trials; R.G., W.J.R., E.E., R.K.H. and ZY. assisted with cryo-EM data collection; S.C., Z.L., W.J.R. 
and E.T.E. performed initial cryo-EM characterization; R.K.S. measured molecular mass; 

B.K. screened detergents; W.A.H. contributed to structural analysis; Q.R.F., P.A.S. and J.A.J. 
supervised functional analyses; J.F., O.B.C. and H.Y. supervised cryo-EM studies; Q.R.F. and 

A.F. wrote the paper; T.S., V.N.S., M.Q., Z.F. and R.K.S. contributed Methods sections; all authors 
contributed to revision of the manuscript; Q.R.F., A.F., J.P., O.B.C., T.S., Z.F., J.L. and M.Q. 
prepared figures; Q.R.F. conceived and supervised the project. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-020- 
2452-0. 

Correspondence and requests for materials should be addressed to O.B.C., J.F. or Q.R.F. 
Peer review information Nature thanks Ryan Hibbs, Bernhard Bettler and the other, 
anonymous, reviewer(s) for their contribution to the peer review of this work. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


Article 


a b c 
1.05 
GABAB > 
7 Bo 
Heterodimer 52 
38 0.54 
250— ae 
So 
150— E 
1 / GABAB2 0.0 
— ’ GABABiD 0 5 10 15 20 25 
[GABA free (uM) 
5o- —_ 
7 —  — q > 120-5 
= 
— 8 100- 
—— & 3 i 
20— 3a 
= 
| | 5é 60- 
oo 
MWY GABABtbr 2 -© WT / WT / Gai (n=4) 
ac Qi (n= 
(kDa) GABAB2 = § 99-] 
T T T T T ; oO —® 802 / 819 / Gai (n=4) 
0 5 10 15 20 25 mi Heterodimer So . ; ; 


Extended Data Fig. 1| Purification and functional analysis of the human 
GABA, receptor. a, Superose 6 size-exclusion chromatography profile of 
detergent-purified GABAg,,,(1-802)-GABA,, (1-819) complex. b, SDS PAGE gel of 
the size-exclusion peak fraction from a under reducing conditions. For gel 
source data, see Supplementary Fig. 2.c, Dose-dependent [7H]GABA binding to 
purified GABA,,,(1-802)-GABA,,(1-819) complex, reaching maximum molar 
ratios of GABA-to-receptor binding at 0.98 + 0.03 mol mol™. Each data point 
represents the mean of triplicate measurements froma typical experiment. 
The experiment was repeated four times with similar results. Data were 
subjected to nonlinear regression fitting, and the dissociation constant 
(K,=1.3 + 0.16 iM) is reported as means +s.e.m. of the fit. d, Functional analysis 
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comparison of full-length and truncated WT GABA, receptor. Shown is the 
dose-dependent baclofen-stimulated receptor response in cells transiently 
expressing Ga,;; (abbreviated as Gai) with full-length GABA, heterodimer or 
the C-terminally truncated GABAg;,,(1-802)-GABA,,(1-819) complex. Cells 
transfected with Ga,,; alone were used as a negative control. Relative 
agonist-stimulated activity was measured by IP, accumulation, and is 
expressed as a percentage of maximum wild-type activity induced by baclofen 
relative to the activity of Ga,;,alone. Data points represent averages + s.e.m. of 
multiple experiments (n), each with quadruplicate measurements. Cell surface 
expression level was 106% for the GABAg;,,(1-802)-GABA,, (1-819) complex in 
comparison with the full-length WT/WT heterodimer. 
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Extended Data Fig. 2|See next page for caption. 
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Extended Data Fig. 2 | Cryo-EM imaging of human GABA, receptor. 

a, Workflow of cryo-EM data processing. b, A representative motion-corrected 
cryo-electron micrograph of the GABA, receptor. c, Reference-free 2D class 
averages, highlighting clear density for transmembrane helices. d, Global 
density map, coloured according to local resolution, in full and clipped views 
perpendicular to the plane of the membrane. e, Global FSC curve (purple) 
corrected by high-resolution noise substitution. The overall resolution as 
determined by an FSC cut-off value of 0.143 (blue line) is 3.3 A.f,3D FSC curves, 
measuring directional resolution anisotropy. Plots show the global half-map 
FSC (red solid line; left-hand y-axis), together with the spread of directional 


resolution values within +1 standard deviation of the mean (area encompassed 
by green dashed lines), anda histogram (turquoise; right-hand y-axis) of 10O 
such directional resolution values sampled evenly over the 3D FSC threshold 
value of 0.143 (blue line). The sphericity value reported by 3D FSC is 0.958 out 
of1.g,h, Separate FSC curves for the locally refined reconstructions of ECDs g) 
and transmembrane domains (h). The blue lines mark the resolution 
corresponding to an FSC value of 0.143 (ECD, 3.1A; transmembrane region, 
3.4A).The adjacent diagrams show the masks (translucent surface) used for 
each local refinement. 
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Extended Data Fig. 3 | Structural model ofthe GABA, receptorfitwithinthe — allseventransmembrane helices of the GABAg,, subunit (e). The density map isa 


cryo-EM map. a—d, Cryo-EM density maps and refined models for the LB1 composite of the locally refined reconstructions for the ECD and 
interface helices (H-B and H-C) inthe extracellular VFT (a), the linker between transmembrane domains. The N- and C-terminal residues of each helix are 
the VFT and transmembrane domains (b), all seven transmembrane helices of labelled. 


the GABAg,, subunit (c), the ten modelled transmembrane cholesterols (d) and 
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Extended Data Fig. 4| Architecture of the GABA, receptor. a, Structure of 
the GABA, receptor in four views related by 90° rotations about an axis 
perpendicular to the membrane. GABA,,, (blue) and GABA,, (green) are 
rendered as ribbons; Ca” is shown as a green sphere; phospholipids (PE 38:5 
and PC 38:2) are presented as yellow space-filling models; the observed 
N-linked glycans (NAGs) and cholesterols (CLRs) are shown respectively as grey 
and pink ball-and-stick models. Transmembrane helices 1-7, along with N and 


Cytoplasmic end 


Ctermini, are marked for each subunit. b, Cryo-EM density map of the GABA, 
receptor, inthe same orientations and colour schemeasina. The mapis 
composed of local reconstructions for the ECD and transmembrane domain, 
which were independently refined to3.1A and 3.4 A, respectively. c, Linker and 
transmembrane domain of the GABA, receptor viewed from the extracellular 
and intracellular sides. 
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Extended Data Fig. 5| See next page for caption. 
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Extended Data Fig. 5 | Heterodimer conformation and interface features of 
the GABA, receptor. a, b, Cryo-EM structure of near full-length GABA, 
receptor (cyan) superimposed with the crystal structure of its extracellular 
VFT module in the inactive state (PDB code 4MQE; purple) (a) or active state 
(PDB code 4MS3; red) (b). The middle panel shows the heterodimeric receptor 
structures superimposed based on the LB1 domain of the GABA,,, subunit. The 
two side panels show superpositions of individual GABAg;, and GABAg> 
subunits onthe basis of their respective LB1 domains. Inb, the green line 
denotes the axis of rotation that relates the LB2 domains of near full-length and 
VFT structures of GABAgy, (rotation x = 28°; screw translation T,=0.6 A), ornear 
full-length and VFT structures of GABAg, (rotation x =7°; screw translation 
T,=0.3 A). c,d, Extracellular LB2 domains viewed from the C-terminal end. 
Superposition of near full-length (cyan) and VFT structures (c, inactive state, 
purple; d, active state, red) based on the LB1 domain of the GABA,,, subunit. 


Within each heterodimeric complex, the C termini of the LB2 domains inthe 
GABA,,, and GABA,, subunits are shown as spheres, and the distance between 
the two C termini is marked by dotted lines. e, Cryo-EM structure of full-length 
mGlu, in the inactive (PDB code 6N52) and active (PDB code 6N51) 
conformations”°. f, Molecular surface of the GABA;,,-GABA,, complex, 
showing the plane of the heterodimer interface for the extracellular and 
transmembrane domains. Structural elements involved in heterodimer 
formation are highlighted in cartoon format (ectodomain, H-B and H-C 
helices; transmembrane domain, TMS and TM3 helices). g, The GABA, 
transmembrane domain viewed from the extracellular side, comparing the 
locations of core (1, II, Illa, IIIb) versus peripheral cholesterol-mediated 
heterodimer contacts from different layers. Heterodimer contacts mediated 
by two cholesterols (CLR6 and CLR3) are shownat the bottom. 
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Extended Data Fig. 6| Extracellular ligand binding in GABAg,. 

a, b, Molecular surface (a) and ribbon representation (b) of the GABA,;,,, 
subunit, showing the location of the Ca**-binding site and an unmodelled 
density at the interdomain crevice of VFT. c, d, Functional analysis of the 
impact of endogenous Ca**. Basal activity (c) and dose-dependent 
baclofen-stimulated receptor response (d) in cells transiently expressing the 
Ga, is subunit (abbreviated as Gqi) with different combinations of WT and 
mutant GABA,-receptor subunits (GABA,,, E309K, abbreviated as E309K; 
GABAg,, E423R, abbreviated as E423R). IP, accumulation by the WT/WT 
heterodimer was measured in the presence and absence of 2.5 mMEGTA. Cells 
transfected with Ga,,, alone were used as negative controls. Relative activity in 
both graphs is expressed as a percentage of the maximum wild-type activity 
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induced by baclofen relative to the activity of Ga,;;alone. Data points represent 
averages +s.e.m. of multiple experiments (n), each with quadruplicate 
measurements. **P=0.0016, ***P=0.0002, ****P< 0.0001; one-way ANOVA 
with Bonferroni’s post hoc test was used to calculate statistical differences in 
basal activity (c). Cell surface expression level was 107% for E309K/WT and 87% 
for E423R/WT mutants in comparison with the WT/WT heterodimer. e, Fitting 
of GABA into the extra density (contoured at 7.00) at the orthosteric 
ligand-binding site and its potential interaction with GABA,,,. f, Concentration 
of endogenous GABA in the supernatant and lysate of HEK 293 GnTI cells after 
recombinant expression of the GABAg receptor, as well as cell culture medium 
and lysis buffer controls, as detected by mass spectrometry. 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7 | Endogenous phospholipid-binding sites of the 
GABA, receptor. a, b, Ribbon representation of the GABA,,, (a) and GABA,, (b) 
transmembrane domains, highlighting the cryo-EM density for phospholipids 
contoured at 4.00. Phospholipids are rendered in ball-and-stick 
representation. c, d, Electrostatic potential surface of the lipid-binding pocket 
in GABA,,, (¢) and GABA,, (d). The phospholipids are shown as spheres. 
Charged residues that directly contact the phosphate group of each lipid are 
marked. e-h, Comparison of phospholipids bound to GABA, subunits with 
ligands bound to the class A GPCRs rhodopsin’ (PDB code 1F88) and S1P, 
receptor” (PDB code 3V2Y) (e), the class B GPCR corticotropin-releasing factor 
receptor 1 (CRF,; ref. *’; PDB code 4K5Y) (f), the class CGPCRs mGlu, (ref. ??; PDB 
code 4OR2) and mGlu, (ref. 7°; PDB code 4009) (g), and the class FGPCR 
Smoothened* (PDB code 4JKV) (h). In each panel, the Ca trace of the GABAgi, 
linker and transmembrane domain is shown in two orthogonal views in grey, 


and the superimposed GABAsg ligands PE 38:5 and PC 38:2 are shown as blue and 
green stick models, respectively. Various GPCRs were overlapped onto the 
transmembrane domain of GABA,,, to bring their bound ligands into 
superposition. i,j, Schematic diagrams of the specific contacts between 
GABAg,, and PE 38:5 (i), and between GABA,, and PC 38:2 (j). Selected contacts 
between residues and phospholipids are highlighted; hydrogen bonds, red 
dotted lines; hydrophobic contacts, black wiggled lines; polar interactions, 
green curved lines; pi-stacking interactions, orange box wave. Red zigzags 
indicate contacts that are mediated by main-chain atoms. Lipid-interacting 
residues that are conserved in the two subunits are highlighted in bold and 
include: (1) head group (GABAgy,, His 643 and Arg 549°"; GABA,,, His 647 and 
Arg 556°”); (2) 20-carbon fatty acyl chain (GABAg,,, Phe 557°**°, Tyr 657° and 
Ala703°**; GABAgp, Tyr 564°*°, Tyr 661° and Ala 707°); (3) 18-carbon fatty 
acyl chain (GABA gy, Ile 724”*°; GABAg,, Ile728”*°). 
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Extended Data Fig. 8 | See next page for caption. 


Extended Data Fig. 8 | Endogenous phospholipid interactions with GABA, 
receptor. a,c, e, Orthogonal views of a potential access channel in GABA,;,, 
GABA,, and S1P, in molecular surface representation, along with phospholipids 
PE 38:5, PC 38:2 or the sphingolipid mimic MLOS6, in space-filling 
representation. Side views (left) show an opening between helices TMS and 
TM6 in GABAg,, (a) and GABA,, (c), and between TM1 and TM7 in SIP, (e), while 
top views (right) highlight the blocked entrance to the lipid-binding pocket 
from the extracellular side. In all cases, ECL1 and ECL3 (orange), ECL2 (pale 
brown), the linker of GABA,-receptor subunits (pink) and the N-terminal helix 
of S1P, (pink) are distinguished by colour. b, d, f, The same information asina,c, 
e but with ribbon models for GABA,;, (b), GABA,, (d), and SIP, (ref. 7*) (PDB code 
3V2Y) (f). Lipids are in stick format. g, h, Functional effect of aGABA,, lipid- 


binding site mutation. Basal activity (g) and dose-dependent baclofen- 
stimulated receptor response (h) in cells transiently expressing Go,;; (Gqi) with 
WT GABA, receptor or WT GABA,,, and mutant GABA,, R714A (R714A) 
heterodimer. Cells transfected with Ga,j; alone were used as negative controls. 
Relative activity in both graphs was measured by IP, accumulation, and 
expressed as a percentage of the maximum wild-type activity induced by 
baclofen relative to the activity of Ga,;; alone. Data points represent 

averages +s.e.m. of multiple experiments (n), each with quadruplicate 
measurements. **P= 0.0016; one-way ANOVA with Bonferroni's post hoc test 
was used to calculate statistical differences in basal activity (g). Cell surface 
expression level was 77% for the WT/R714A mutant in comparison with the 
WT/WT heterodimer. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 | Comparison of the GABA; transmembrane domain 
with other GPCRs. a, Super position of GABA,,, and GABA,, subunits based on 
their VFT modules. The short purple line denotes the axis of rotation that 
relates the linker and transmembrane domains of GABA,;, and GABA, (rotation 
X= 23°; screw translation t,=0.01 A).b, Superposition of the linker and 
transmembrane domains of the GABA,,, and GABAg, subunits. c, Superposition 
of the linker and transmembrane domains of each GABA, subunit with the class 
CGPCR mGlu, (ref. 7°; PDB code 6N52) in three different views, with arrows 
revealing inward extracellular shifts in TM5 and TM7, as well as an inward 


intracellular shift in TM3, in mGlu; compared with either GABA, subunit. 

d-f, Superposition of the transmembrane helices of each GABA, subunit with 
the class AGPCR rhodopsin” (PDB code 1F88) (d), the class BGPCR CRF, (ref. °”; 
PDB code 4KSY) (e), and the class F GPCR Smoothened** (PDB code 4JKV) (f). 
Arrows indicate large shifts in transmembrane helix positions between GABA, 
subunits and other GPCRs, such as outward extracellular movements inclass A 
(TM2 and TM6), class B(TM1, TM2 and TM7), and class F (TM2, TM4 and TMS). 
There are also inward intracellular shifts in TM7 in all comparisons, and 
outward intracellular shifts in TMS in class A, Band F. 
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include the conserved Lys 7397 of GABA,,, and Lys 743”* of GABA,,, which participating residues of the ‘ionic lock’ are denoted by red dotted lines with 


interact with the ‘ionic locks’ through Asn?*° (GABA, N513?°; GABA,, N5202°) _—_ distances labelled. Distances between the Ca atoms of the ‘ionic lock’ residues 
and a serine (GABAg;,, S508; GABAg, $515) inICL1. c, The ‘ionic lock’ and NPxxY are marked by brown dotted lines. 
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ICP-MS: Syngistix v1.1. 
Cell-based functional assay data collection: PHERAstar FS v3.10 R3; 
MALDI-MS: FlexControl3.4.135. 


Data analysis Cryo-EM: MotionCor2 v2.1, Gctf v1.06, Relion 3.0, cryoSPARC v2.4, UCSF Chimera v1.13.1; 
Model building and refinement: COOT v0.8.9.0, Phenix v1.14, Molprobity, EMRinger; 
Structural alignment: LSQMAN v9.7.9; 
Figures: Pymol v1.8.6.0 and v2.3.1, UCSF Chimera v1.13.1; UCSF ChimeraxX v0.9; 
Cell-based functional assay data analysis: PRISM v8.2.0; 
SPA binding data analysis: SigmaPlot v13.0; 
LC-MS/MS: MS-DIAL v3.40, LipidBlast v49. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 
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Data Bank under accession codes EMD-21685. Atomic coordinates for the GABA(B) receptor structure are deposited in the RCSB Protein Data Bank under the 
accession number 6WIV. 
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Sample size No sample size calculation was performed. 
Cell-based IP accumulation data represent average of multiple experiments (n = 3 - 15), each consisting of quadruplicate measurements. 
CP-MS data represent average of eight measurements within two experiments. 


Data exclusions No data was excluded from analysis. 


Replication Cell-based IP accumulation experiments were repeated three to fifteen times. 
CP-MS experiments were repeated twice. 


Randomization Randomization was not relevant to the experiments in our study. 


Blinding Blinding was not relevant to the experiments in our study since no subjective allocation was involved. 
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Antibodies used 1. Mouse monoclonal ANTI-FLAG® M2 Affinity Gel, Milliporesigma A2220. 
2. Mouse monoclonal ANTI-FLAG® M1 antibody, MilliporeSigma F3040, clone M1, 1:300 dilution. 
3. IRDye® 800CW Goat anti-Mouse IgG Secondary Antibody, Li-Cor Biosciences 926-32210, 1:4,000 dilution. 


Validation 1. Mouse monoclonal ANTI-FLAG® M2 Affinity Gel: Validation by Milliporesigma; application included purification of FLAG fusion 
proteins. 
2. Mouse monoclonal ANTI-FLAG® M1 antibody: Validation by Milliporesigma for detection of proteins with an N-terminal FLAG 
marker; application included immunocytochemistry (on-cell western). 
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Makaia M. Papasergi-Scott'”“, Michael J. Robertson'”“, Alpay B. Seven’”, Ouliana Panova’”, 
Jesper M. Mathiesen* & Georgios Skiniotis'?™ 


Stimulation of the metabotropic GABA, receptor by y-aminobutyric acid (GABA) 
results in prolonged inhibition of neurotransmission, which is central to brain 
physiology’. GABA, belongs to family C of the G-protein-coupled receptors, which 
operate as dimers to transform synaptic neurotransmitter signals into a cellular 
response through the binding and activation of heterotrimeric G proteins”*. However, 
GABAgis unique in its function as an obligate heterodimer in which agonist binding 
and G-protein activation take place on distinct subunits*”. Here we present 
cryo-electron microscopy structures of heterodimeric and homodimeric full-length 
GABA, receptors. Complemented by cellular signalling assays and atomistic 
simulations, these structures reveal that extracellular loop 2 (ECL2) of GABA, has an 
essential role in relaying structural transitions by ordering the linker that connects the 
extracellular ligand-binding domain to the transmembrane region. Furthermore, the 
ECL2 of each of the subunits of GABA, caps and interacts with the hydrophilic head of 


a phospholipid that occupies the extracellular half of the transmembrane domain, 
thereby providing a potentially crucial link between ligand binding and the receptor 
core that engages G proteins. These results provide a starting framework through 
which to decipher the mechanistic modes of signal transduction mediated by GABA, 
dimers, and have important implications for rational drug design that targets these 


receptors. 


The neurotransmitter GABA is primarily responsible for synaptic 
inhibition throughout the nervous system, via activation of GABA, 
ion channels and pre- and postsynaptic GABA, receptors. GABA, 
stimulation of the G,,, class of heterotrimeric G proteins results ina 
prolonged decrease in neuronal excitability via the inhibition of ade- 
nylyl cyclase and voltage-gated Ca”* channels, as well as the opening of 
Gprotein-coupled inward-rectifying potassium channels?**. Abnormal 
execution of GABA, signalling is responsible for several neuropsychiat- 
ric diseases, and the receptor is an attractive drug target for arange of 
disorders, including drug addiction, pain, epilepsy, spasticity, anxiety 
and gastro-oesophageal reflux disease”®. 

There are about 20 G-protein-coupled receptors (GPCRs) in fam- 
ily C, including GABAg, the metabotropic glutamate receptors (mGlu1 
to mGlu8), the calcium-sensing receptor, two taste receptors and 
several orphan receptors®”’. These receptors operate as obligate 
dimers, in which each subunit is composed of a bilobed extracel- 
lular ligand-binding domain (termed the Venus flytrap (VFT)) anda 
7-transmembrane domain (7TM) that are connected viaa linker region”. 
Previous crystallographic studies of the VFT from various members 
of family C (including GABA,) have shown that agonist binding rear- 
ranges the VFT ina way that would bring the linker regions into close 
proximity”. All GPCRs in family C contain a cysteine-rich domain 
within the linker region—with the exception of GABAg, which has a 
relatively short linker. Recent structures of mGluS have shown that 
the cysteine-rich domain interacts with ECL2 of the transmembrane 
region, thereby transducing conformational changes 120 A away from 


the ligand-binding site on the VFT to the 7TM (where G-protein acti- 
vation occurs)". Given that GABA, lacks a cysteine-rich domain, the 
structural communication between an agonist-bound VFT and the 
7TM remains unclear. 

Furthermore, unlike other GPCRs in family C, GABA, is an obligate 
heterodimer of dissimilar subunits—GABA,, and GABA,,”. Agonist 
binding occurs only onthe VFT of GABA,,, and G-protein coupling and 
activation occurs exclusively through GABA,,*”. Therefore, besides its 
pharmacological interest, GABA, presents an ideal model system to 
study trans-activation mechanisms of dimeric family C GPCRs. How- 
ever, such studies have been limited by the lack of structural infor- 
mation on full-length GABA,, restricting our ability to understand 
how agonist binding to GABA,, results in G-protein activation on the 
intracellular side of GABA,,. Here we use single-particle cryo-electron 
microscopy (cryo-EM), atomistic simulations and cellular signalling 
assays to obtain structural and mechanistic insights into full-length 
GABA, receptor complexes. 

For structural studies, we co-expressed recombinant constructs 
of human GABAg,, and GABA,, in insect cells and purified the GABA, 
heterodimer by tandem affinity chromatography in the presence of 
the inverse agonist CGP55845"°. We determined the structure of the 
inactive detergent-solubilized GABA, heterodimer at a global indi- 
cated resolution of 3.6 A (Fig. 1, Extended Data Figs. 1, 2a, Extended 
Data Table 1). Our cryo-EM map resolved the entire GABA, apart from 
the C-terminal coil—-coil domain, which appears to be flexible in rela- 
tion to the transmembrane regions. Additionally, our focused map 
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Fig. 1| Cryo-EM map and model of the full-length GABA, receptor 
heterodimer in inactive state. a,b, GABA, receptor cryo-EM map (a) and 
ribbon diagram (b) (GABAg,,, teal; GABA,,, tan; GDN micelle, white; CGP55845, 
gold; phospholipids, orange; glycosylation, blue and pink). Elongated densities 
at the 7TM interface are probably lipid or detergent molecules (yellow). 

c, GABAF,,,, increases with higher levels of cotransfected GABA,, and GABA, 
plasmid DNA. d, ECL2 truncation of either subunit of GABA, increases F,,,,, of 
GABA-stimulated response when compared to cotransfected wild-type (WT) 
receptors of similar surface expression (Extended Data Fig. 6a); ECL2 
shortening of both protomers inhibits activity. For sample sizes, replicability, 
sample collection and descriptions of error bars for all figures, see ‘Statistics 
and reproducibility’ in Methods. 


of the extracellular region at a resolution of 3.5 A provided improved 
density for the linker regions, and assisted with modelling (Extended 
Data Fig. 1). The asymmetric protomers (GABA, and GABA,,) share a 
similar secondary structure and arrangement, but are distinguished 
by differential glycosylation and a well-resolved ligand density within 
the GABA, VFT that is absent in GABA,, (Fig. 1, Extended Data Figs. 3, 
4a). The upper VFT lobes of GABA, form ajunction, whereas the lower 
VFT lobes are separated by about 20 A (Fig. 1, Extended Data Fig. 3). The 
CGP55845 ligand adopts a horseshoe-like conformation (which was 
confirmed by GemSpot”) that closely resembles the crystal structure of 
the GABA,, VFT in complex with the inhibitor CGP54626”"* (Extended 
Data Fig. 3). We also observe a small spherical density in GABA, that 
appears to be interacting with the backbone carbonyl of G277, inaddi- 
tion to several anionic groups and a tyrosine, which raises the possibility 
that a divalent cation resides in that location (Extended Data Fig. 3e). 

Inactive GABA, assumes an overall morphology similar to that of 
the apo-state structure of mGlu5“; the most substantial differences 
between these two arise within the linker region (Fig. 1b). Bridging the 
VFT and 7T™, an approximately 20-residue linker forms a B-sheet in 
conjunction with the ECL2. Notably, the length of the GABA, receptor 
ECL2is nearly twice that of mGlu5 (Fig. 1, Extended Data Fig. 4b). Inthe 
absence of cysteine-rich domains, the B-sheet structure of the GABA, 
linker in complex with ECL2 orders this region, with the additional 
ECL2 length thus allowing coupling between the 7TM and VFT for sig- 
nal transduction. Our molecular dynamics simulations support the 
structural stabilization of the linker through the B-sheet formation. 
After 200 ns, in all simulations the linker and ECL2 continued to adopt 
a stable structure even in the absence of the VFTs, although we did 
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Fig. 2|TM3 and TMS stabilize an inactive-state dimer interface of GABA, 
7TM.a, Inactive-state GABA, 7TMs (GABAg,, teal; GABA,,, tan) are in closer 
proximity than those of inactive mGlu5 (purple) (PDB 6N52"), top-down view. 
b, The GABAg, and GABA,; interface is stabilized through hydrophobic 
interactions along the TMS helices, and by polar residues on the intracellular 
side of TM3 and TMS (boxed). c, Polar interaction residues from the 
intracellular side of the receptor. GABAg, E677 is shown for perspective; 
however, the map density for this residue was insufficient for high-confidence 
modelling of its side chain. IP, accumulation assays illustrate differences in the 
basal activity (d) and GABA response (e) of GABAg TM3 and TMS mutants. 


observe a slight downward rotation in B-sheet orientation (Extended 
Data Fig. 5). To further examine the involvement of ECL2 in receptor 
activation, we used a functional assay with a chimeric Ga,,,, in which 
the G,,-coupled GABA, receptor can couple to the PLC pathway*”’. We 
measured the accumulation of the downstream metabolite inositol 
monophosphate (IP,) by LiCl using an established assay”, and thereby 
monitored the GABA-stimulated and basal receptor activity (Fig. Ic). 
ECL2 shortening was generally inhibitory to the proper membrane 
trafficking of the subunits of GABA,. Accordingly, we normalized the 
transfected DNA to obtain similar expression levels between con- 
structs (Extended Data Fig. 6a). The partial deletion of the ECL2 of 
GABA,, (residues 627-634)—comprising the unstructured tip of the 
loop nearest the VFT—produced an increase in basal activity, but did 
not affect the GABA maximal response (F,,,,) when expressed with 
wild-type GABA,, (Fig. 1d). These findings indicate that abrogating the 
ECL2-linker allows flexibility in the GABA,, VFT relative to the rest of 
the receptor dimer, resulting in activation through the GABA,, VFT- 
7TM route in the absence of agonist”°. By contrast, the partial deletion 
of the ECL2 of GABA,, (residues 631-638) did not affect basal activity, 
but produced an increase in GABAF,,,, and a decrease in GABA potency 
when compared to wild-type receptor expressed at similar levels at 
the cell surface—indicating that the ECL2 of GABA,, may be partially 
inhibitory (Fig. 1d, Extended Data Fig. 6a). When both receptors contain 
a truncated ECL2, we observed a decrease in GABA F,,,,, Which sug- 
gests that at least one VFT must be structurally coupled through the 
extended ECL2-linker for full activation (Fig. 1d). Collectively, these 
data support a bimodal transactivation mechanism of GABAg, in which 
agonist binding on the GABA, receptor can proceed from the GABA;, 
VFT down to the GABA,, 7TM region to enact changes in GABA,, that 
promote G-protein activation, and also activate the GABA;, 7TM directly 
through the GABA,, VFT. 

Besides those of the VFT and the C-terminal coiled-coil, we observe 
that inactive GABA, forms an additional dimer interface between trans- 
membrane (TM)3 and TMS from each monomer of the heterodimer 
(Figs. 1, 2). The interface is formed by ionic interactions between resi- 
dues onthe intracellular side of each receptor—H572 in TM3 and E673 
in TMS of GABAg,, and H579 in TM3 and E677 in TMS of GABA,,—and is 
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Fig. 3 | Phospholipid binds within the transmembrane cores of GABA,. 

a, Electron microscopy map clipped to show the location of phospholipid 
within GABA,, of the heterodimer. b,c, Ribbon representation of 
phosphatidylethanolamine within GABAg, (b), with boxed region (c) to show 
the structural environment of the headgroup. d, GABA concentration response 
for wild-type GABA, (red), and mutants designed to displace phospholipid 


further stabilized through aromatic residues along the same helices 
(Fig. 2b, c). We also observed elongated densities, probably correspond- 
ing to glyco-diosgenin (GDN) and/or cholesteryl hemisuccinate (which 
were used in purification), packed between the extracellular sides of 
the two 7TM regions, which may also stabilize the interface (Fig. 1a, 
Extended Data Fig. 2i). The dimer interface contrasts with that of the 
inactive mGlu5 receptor, in which a distance of about 16 A separates the 
two7TM regions" (Fig. 2a). To probe the importance of these interac- 
tions, we mutated the TM3-TMS interface and performed IP, accumu- 
lation assays (Fig. 2d, e, Extended Data Fig. 6). Notably, mutation of 
either H579 or E677 on GABA,, to alanine, or mutation of both E673 and 
H572 on GABA, to alanine, increased the basal activity of the receptor, 
which suggests that the TM3-TMS interface is inhibitory to signalling 
in the absence of agonist, which further supports the transactivation 
mechanism described above (Fig. 2, Extended Data Fig. 6b-e). When 
expressed at the cell surface alone, GABA,,(H579A/E677A) exhibited 
aslight increase in basal activity compared to GABA,, alone (Extended 
Data Fig. 6d). These findings further support a role of an intraprotomer 
H579-E677 interaction in GABA,, to assist stabilizing an auto-inhibited 
state, and are consistent with the lack of constitutive activity by the 
GABAg,,(A631-638) mutant. 

The most unexpected observation in the transmembrane region of 
both monomers of GABAgis a wishbone-shaped density that occupies 
the extracellular half of each 7TM core. The bifurcated density, which 
is better resolved within the GABAg, helical bundle, corresponds toa 
phospholipid (Fig. 3, Extended Data Fig. 2). Our consideration of the 
size and shape of the density, the surrounding amino acid environ- 
ment within the 7TM core and the known phospholipid composition 
in Spodoptera frugiperda (Sf9) insect cells led us to infer that the den- 
sity in GABAg,, and probably GABA,,, corresponds to phosphatidy- 
lethanolamine (Fig. 3, Extended Data Fig. 7). However, it is possible 
that the observed density is the result of a heterogeneous mixture 
of phospholipids. The observation of the phospholipid is particu- 
larly intriguing, as—to our knowledge—no other GPCR structure has 
revealed a two-chained phospholipid within this space. Althougha 
lipid-activated subfamily within family A GPCRs exists, known ligands 
are single-acyl-chain lipids, eicosanoids and sterols”. Notably, GABA, 
residues in the extracellular loops (including ECL2) and TM3 of GABA, 
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(green) or de-stabilize the phospholipid headgroup (blue). e, f, Molecular 
dynamics simulations show a collapse of the transmembrane cavity when lipid 
is absent. e, Violin plot of cavity-volume ensemble data. f, Representative 
top-down view of the GABA, ribbon and stick model from molecular dynamics 
simulations at 200 ns. 


coordinate the polar headgroup of the phospholipid, resulting ina ‘lid’ 
over the 7TM cavity that resembles the lipid-binding GPCRs of family A. 
The remaining hydrophilic atoms in the lipid appear solvent-exposed, 
whereas the lipid tails are buried deep into the hydrophobic portion of 
the transmembrane cavity (Fig. 3b, c). The presence of the 7TM lipid 
in our cryo-EM structures suggests it is bound tightly, as is also sug- 
gested by its retention during detergent solubilization of the receptor. 
Additionally, a conserved TM6tryptophan—which acts as an activating 
‘toggle-switch’ in other GPCRs of family C””?—is replaced by cysteine in 
both GABA,, and GABA,, (Extended Data Fig. 4c, d). This replacement is 
essential, as any probable tryptophan rotamer at that position would 
clash sterically with the bound phospholipid. 

The presence of lipid in the 7TM core suggests this lipid may have 
a physiological role in the structural and functional integrity of the 
transmembrane bundles, occupying a region that corresponds to the 
ligand-binding site in other classes of GPCR (Extended Data Fig. 4). 
Moreover, given the role of the ECL2 in coupling the VFT to the 7TM 
regions, it is conceivable that interactions of the phospholipid with 
the ECL2 are essential for increasing the stability of the linker region 
and also participate in relaying structural transitions from the VFT to 
the 7TM core. Mutations designed to displace lipid tails inthe 7TM 
core (GABA,,(L553W) and GABA,,(L560W)) decrease basal activity 
and potency of GABA while increasing GABA F,,,,, whereas mutation 
of the arginine that coordinates the headgroup of the phospholip- 
ids (GABA,,(R549A) and GABA,,(R556A)) results in increases in both 
GABA, basal activity and receptor response to GABA (Fig. 3d, Extended 
Data Fig. 6f, g). Collectively, these results indicate that destabilization 
of phospholipid in the core affects receptor activation, and it is thus 
conceivable that the lipid directly modulates receptor activity as a 
result of ECL2 movements in native GABAg. To further probe the role 
of phospholipid, we used a range of atomistic simulations with the 
GABA, receptor system. We initiated simulations of the GABA, 7TM 
and linker with the VFT removed, both with and without lipid. The 
simulations revealed that without lipid the transmembrane helices 
begin to collapse into the core, with mean decreases in cavity volume 
of about 30% (Fig. 3, Extended Data Fig. 5, Supplementary Data 1, 2); 
this suggests a strongly coupled interplay between the phospholipid 
and 7TM core. Alternatively, multiple simulations that started without 
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Fig. 4| Inhibitor-bound GABA,, homodimers adopt a VFT orientation 
similar to that of active GABA, heterodimer, and a7TM interface similar to 
that of active mGluS. a, VFT overlay of structures of GABA, heterodimer 

(tan and teal) in inactive (top) and active (bottom) state (PDB 4MS4”), with the 
model of GABA,;, homodimer (blue). The upper-lobe interaction interface 
(boxed) of the GABA,, homodimer matches that of the active-state 


lipid in the transmembrane helices showed lipid tails from the bilayer 
entering the receptor hydrophobic cavity below residue Y661, at a 
site akin to that at which the tail protrudes in our cryo-EM structure. 
Peripheral lipid-tail insertion in simulations correlated with circum- 
vention of the collapse of the 7TM cavity (Extended Data Fig. 5). Inone 
simulation, after 200 ns we observed a lipid that had inserted both of 
its hydrocarbon tails into the receptor core. Extending the simula- 
tion by an additional 100 ns revealed the headgroup had moved over 
the top of the receptor, indicating lipid entry may occur as a stepwise 
process in which the lipid-tail enters by sliding between TMS and TM6, 
and is followed by the entire lipid (Extended Data Fig. 5). Although it 
is probable that lipid insertion into the 7TM core occurs concurrently 
with helix insertion into the membrane during protein folding, these 
results suggest a clear tendency for phospholipids to insert at that 
position in a mature receptor. 

Pharmaceutical agents targeting the GABA, receptor and other 
GPCRs of family C are proposed to function either at the orthosteric 
ligand-binding site or allosterically within the transmembrane core, as 
with orthosteric ligands of family AGPCRs. However, the bound phos- 
pholipid appears to occlude the binding of allosteric modulators analo- 
gous to those used to target other family C receptors”*. Consequently, 
potential allosteric modulators of GABA, would need to either displace 
the core lipid or bind at an alternative site peripheral to the bundle. 

Although we obtained pure GABAs heterodimer by tandem affinity 
purification of receptor constructs with distinct tags, when both subu- 
nits contain an N-terminal Flag tag we also purified a considerable frac- 
tion (>40%) of GABA,, homodimers. In cells, an endoplasmic-reticulum 
retention signal within the GABA,, coil-coil domain prevents GABA,; 
from reaching the plasma membrane, unless this domain is masked 
by the GABA,, C terminus or other interacting partners”>”®. Thus, the 
presence of GABA,, homodimer in our preparation is probably due 
to the persistence of internal membranes during the purification 
procedure. However, multiple studies have previously suggested a 
physiological role for GABA,, homodimers within some cell types of 
the nervous system and gastrointestinal tract that express GABAg;, 
isoforms inthe absence of GABA,” ~. Althoughit is as yet unclear how 
homodimeric GABA,, functions in the absence of GABA,, G-protein 
coupling, we sought to obtain further mechanistic insights into the 
GABA, system and obtained the structure of the GABA,;,, homodimer 


GABA g; 
(heterodimer) 


(homodimer) (active state) 


heterodimer. b, The GABA,, subunit of the heterodimer (teal) and homodimer 
(blue) were aligned to the 7TM, revealing differences in VFT and linker 
positioning. c, Top-down view of superposed 7TM of GABA, heterodimer 

(tan and teal) and homodimer (blue), showing the marked difference inthe 
protomer interface. d, Top-down view of superimposed 7TM of agonist-bound 
mGlu5 (purple) (PDB 6N51") and GABA,, homodimer (blue). 


at a global indicated resolution of 3.2 A (Fig. 4, Extended Data Figs. 2, 
8, Extended Data Table 1). 

The structure of the GABA,;, homodimer shows that both of the VFTs 
are liganded, and that the monomers assume a roughly two-fold sym- 
metric arrangement. The individual lobes of each VFT are in an open 
conformation, and are superimposable with the GABA,, subunit VFT 
inthe inactive heterodimer (Fig. 4b). Despite the conformation of the 
individual protomers, the VFT homodimer adopts the same overall 
conformationas the crystal structure of agonist-bound GABA, VFT (Pro- 
tein Data Bank (PDB) code 4MS4)”, owing to an almost-identical upper 
lobe interaction interface (Fig. 4b). Correspondingly, the lower lobes 
of the receptor are brought into proximity and the receptor adopts 
a more-compact state that shortens the distance between the VFTs 
and the 7TMs (Fig. 4). Compared to the heterodimer, the 7TMs in the 
GABA,, homodimer are rotated to form a TM6-TM6 interface (Fig. 4c). 
This is consistent with previous cross-linking experiments that identi- 
fied a TM6-TM6 interaction upon agonist-induced stimulation of the 
heterodimer”, and agrees with structural rearrangements of mGlu5 
upon stimulation with agonist” (Fig. 4d). Furthermore, the relative 
arrangement of the VFT and transmembrane regions are notably similar 
to those of the activated state of near-full-length mGlu5“ (Extended 
Data Fig. 8). Thus, the conformation observed in the GABA,, homodimer 
may provide valuable hints into the architecture of the active-state 
GABA, receptor (Fig. 4). Previous GABA, chimaera studies have shown 
that replacement of one of the 7TMs of the GABA,, homodimer with that 
of the GABA,, receptor (thus enabling G-protein coupling) produces 
aconstitutively active receptor that is not additionally stimulated by 
GABA**, Hence, the relative positioning of the VFT—as observed in the 
structure of the GABA;; homodimer—appears to be sufficient to stabi- 
lize an active conformation, presumably through the linkage of VFT to 
7TM to reorient the helical interface. Because GABA,, inthe homodimer 
is bound to an inverse agonist, the adoption of an ‘active-like’ global 
conformation of the VFT must be irrespective of the ligand, probably 
owing to differences in the interaction interface of the VFT N-terminal 
lobes in comparison to the same interface in the heterodimer. 

Collectively, our structures, cellular signalling assays and atomis- 
tic simulations provide crucial insights into mechanistic aspects of 
GABA, signalling. The extended ECL2 and its interaction with the linker 
region appears to compensate for the lack of acysteine-rich domainin 
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GABA, compared to all other family C receptors, and thus also trans- 
duce conformational changes from the VFT to the 7TM regions. The 
phospholipid that occupies the 7TM core is important for the integrity 
of the transmembrane domains in both subunits of GABAg, while also 
structurally coordinating the critical ECL2 region. Our findings sup- 
port a model in which agonist binding to GABA,, results in VFT dimer 
compaction that reorients the protomers via the linker-ECL2. Sucha 
conformational change would drive the 7TMs to twist away from an 
auto-inhibited state mediated by the inactive TM3-TMS interface, 
thereby forming a new TM6 helical interface. Although currently 
unclear, the activating transitions within the GABA,, 7TM are prob- 
ably mediated both through the newly formed TM6-TM6 interface 
and propagation through its own ECL2, as supported by our functional 
assays. An intriguing possibility is that the phospholipid may act asa 
sensor of changes in VFT and ECL2 conformations that would result in 
activating transitions within the 7TM of GABA,, to prime it for G-protein 
engagement. Addressing these questions will require detailed struc- 
tural studies of the active-state heterodimer alone, andin complex with 
aG protein engaging the receptor core. The present work—along with 
recent studies on mGluS—forms a starting structural framework for 
deciphering the signal transduction mechanism of GABA, and family C 
GPCRs in the context of full-length receptors. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Cloning 

The cDNA clone for human GABA,, receptor (accession NM_005458) 
in pcDNA3.1' was obtained from the cDNA Resource Center (www. 
cdna.org); and the cDNA clone for human GABA,,, was purchased 
from Horizon Discovery (accession BCO50532, clone ID 5732186). 
Primers were designed to include a haemagglutinin (HA) signal 
sequence” in the place of authentic signal sequences of each recep- 
tor, thus removing the first 29 residues of GABA,, and 41 residues of 
GABA,,. Both the authentic signal sequence and the HA sequence are 
cleaved during processing; therefore, the substitution does not result 
in a change of sequence in the mature receptor. For purification of 
receptors for cryo-EM studies, GABA, constructs were subcloned into 
the pFastBacDual vector (Invitrogen) with N-terminal Flag epitope 
(DYKDDDD) following the HA signal sequence and/or C-terminal 
hexa-histidine (His6) tags, so that the following constructs were pro- 
duced: HA-Flag-GABA,,(30-844)-His6, HA-GABA,,(30-844)-His6 
and HA-Flag-GABA,,(41-941). For signalling assays, the HA-Flag- 
GABA,,(30-844)-His6 and a HA-HA-GABA,,(41-941)-construct (in 
which the Flag tag was replaced by the HA epitope tag YPYDVPDYA) 
were subcloned into pcDNA3.1(+). The primers (Integrated DNA Tech- 
nologies) used to subclone GABA, included: EcoRI-HA-Flag-GABA,,, 
5’-GCGCGCGAAT TCATGAAGACGATCATC GCCCTGAGCTACATCTTCTG 
CCTGGTGTTCGCCGAT TACAAGGACGACGATGACAAGTCCACTCCCCC 
CATCTCCCG-3’; GABA,,-His6-Sall, 5’-GCGCGCGTCGACT TAATGATGATG 
ATGATGGTGCT TATAAAGCAAATGCAC-3’; GABAg;-Sall 5’- GCGC 
GCGTCGACT TACT T ATAAAGCAAATGCAC-3’; EcoRI-HA-FLAG-GABA,,, 
5’-GCGCGCGAAT TCATGAAGACGATCATCGCCCTGAGCTACATCT TC 
TGCCTGGTGT TCGCCGAT TACAAGGACGACGATGACAAGTGGGCGCG 
GGGCGCCCCC-3’; GABA,,-His6-Sall, 5’-GCGCGCGTCGACT TAATGATG 
ATGATGATGGTGCAGGCCCGAGACCATGAC-3’; and GABAg,-Sall, 
5’- GCGCGCGTCGACT TACAGGCCCGAGACCATGAC-3’. To generate 
GABA, mutants, PCR reactions were conducted with either Pfu- 
Turbo or Q5 polymerase using the following pairs of primers and 
pcDNA3.1*neo containing either HA-Flag-GABAg;, or HA-HA-GABA,): 
GABAgj,,(H572A), 5’-GGTGGGTCGCCACGGTCTTC-3’ and 5’-GAAGACCG 
TGGCGACCCACC-3’; GABAg,,(E673A), 5’- CTTGCTTATGCTACCA 
AGAG-3’ and 5’-CTCTTGGTAGCATAAGCAAG-3’; GABA,,(H579A), 
5’-CTGGAGAGTCGCTGCCATCTTCAA-3’ and 5’- TTGAAGATGGC 
AGCGACTCTCCAG-3’; GABA,,(E677A), 5’- CTTAGCT TGGGCTACCCG 
CAAC-3’ and 5’- GTTGCGGGTAGCCCAAGCTAAG-3’; GABA, ;(A627-634), 
5’- CTTGGCAAATGTCTCAATGGTC-3’ and 5’-GTCTCTATTCTGCCC 
CAGC-3’; GABA,,(A631-638), 5’-ATCTCCATCCGCCCTCTCC-3’ and 
5’-CATGCTGTACTTCTCCACTG-3’; GABA, (R549A), 5’- CTGCCAG 
GCCGCCCTCTGGCTCCTG-3’ and 5’-ACGAAAGGGAACTGG-3’; 
GABA,,(R556A), 5’ GCACCGTCGCTACCTGGAT TCTC-3’ and 5’- AAAGTG 
TTTCAAAGG-3’; GABA, (L553W), 5’- CTCTGGCTCTGGGGCCTG 
GGCTTTAG-3’ and 5’-GCGGGCCTGGCAGACG-3’; GABA,,(L560W), 
5’-GGACCTGGATTTGGACCGTGGGCTAC-3’ and 5’-TGACGGTGCA 
AAGTG-3’. 


Expression and purification 

Sf9 insect cells (Expression Systems) were coinfected at a density of 
about 2.0 x 10° cells/ml with HA-Flag-GABA,, baculovirus and either 
HA-Flag-GABAg;,-His6 or HA-GABAg,,-His6 baculovirus at a multiplicity 
of infection between 3.0 and 5.0. During expression, cells were treated 
with 5 uM CGP55845 (Hello Bio). At 48 h post-infection, cells were 
collected by centrifugation, washed once with phosphate-buffered 
saline containing protease inhibitors (leupeptin, soybean trypsin 
inhibitor, N-p-tosyl-L-phenylalanine chloromethyl ketone, 


tosyl-L-lysyl-chloromethane hydrochloride, phenylmethylsulfonyl 
fluoride, aprotinin, bestatin and pepstatin) and 5 uM CGP55845. Cell 
lysis was achieved through nitrogen cavitation in buffer containing 
20 mMHEPES, pH 7.5, 150 mM NaCl, 1 MM EDTA, 10 1M CGP55845, 2 mM 
MgCL,, nuclease and protease inhibitors. The whole-cell lysate was 
centrifuged at 1,000g to remove nuclei and unbroken cells. The super- 
natant was centrifuged at 100,000g to isolate the membrane fraction. 
Membranes were resuspended by Dounce homogenization in buffer 
containing 20 mM HEPES, pH 7.5, 150 mM NaCl, 2mM MgCl, 1 mM EDTA, 
2mg/mliodoacetamide, 10 1M CGP55845, 1% n-dodecyl B-D-maltoside 
(DDM), 0.2% sodium cholate, 0.2% cholesterol hemisuccinate (CHS), 
nuclease and protease inhibitors. Solubilized membranes were clari- 
fied by centrifugation at 100,000g, and the supernatant was loaded 
onto a pre-equilibrated column of anti-DYKDDDDK G1 affinity resin 
(Genscript). The resin was washed with buffer A (20 mM HEPES, pH7.5, 
150 mM NaCl, 10 pM CGP55845 and protease inhibitors) with 0.1% DDM 
and 0.02% CHS. Protein was eluted with buffer A containing 0.1% DDM, 
0.02% CHS and 0.2 mg/ml DYKDDDDK peptide. The eluate was then 
loaded onto a pre-equilibrated Nickel-NTA column. Resin was washed 
with buffer A containing 0.1% DDM and 0.02% CHS; and the buffer was 
exchanged in six steps to buffer A supplemented with 0.2% GDN, 0.02% 
CHS, followed by atwo-step exchange into buffer A containing 0.004% 
GDN and 0.0004% CHS. Protein was eluted from the Ni-NTA resin with 
buffer A containing 0.004% GDN, 0.0004% CHS and 500 mM imidazole. 
The resulting eluate was concentrated by centrifugal filtration with a 
50-kDa molecular-weight cut off, and subsequently run ona Superose 
6 size-exclusion column (GE Healthcare). Samples were pre-screened 
for sample quality by negative-stain transmission electron microscopy 
and then immediately prepared on cryo-EM grids. 


Cryo-EM data collection 
For the GABA,;,—-GABA,, heterodimer, 3.5 pl of sample was applied at 
aconcentration of 3-5 mg/ml to glow-discharged holey carbon grids 
(Quantifoil R1.2/1.3). The grids were blotted using an FEI Vitrobot 
Mark IV (Thermo Fisher Scientific) at 18 °C and 100% humidity, and 
plunge-frozen into liquid ethane. Two datasets were used to produce 
the final structure. For both data collections, cryo-EM imaging was 
performed ona Titan Krios (Thermo Fisher Scientific) electron micro- 
scope equipped with a K3 Summit direct electron detector (Gatan). The 
microscope was operated at 300 kV accelerating voltage, at amagnifica- 
tion of 57,050x in counting mode resulting in a magnified pixel size of 
0.8521 A. For the first dataset, movies were obtained at an exposure rate 
of 14.19 electrons per A? per s with defocus ranging from -1.5 to-2.7 um. 
The total exposure time was 3.985 s over 57 frames per movie stack. For 
the second dataset, movies were obtained at an exposure rate of 21.43 
electrons per A” per s with defocus ranging from -1.2 to -2.5 pm. The 
total exposure time was 2.996 s including 50 frames per movie stack. 
Cryo-EM grids for the GABA,;, homodimer at a concentration 
5.0 mg/ml were prepared similarly to the heterodimer. Cryo-EM imag- 
ing was performed ona Titan Krios electron microscope equipped with 
a post-column energy filter and a K2 Summit direct electron detector 
(Gatan). The microscope was operated at 300 kV accelerating voltage, 
at amagnification of 47,198x in counting mode, resulting in a pixel size 
of 1.06 A. Movies were obtained at an exposure rate of 6.212 electrons 
per A? pers with defocus ranging from -0.9 to -2.5 um. The total expo- 
sure time was 8.0 s over 40 frames per movie stack. Automatic data 
acquisition was performed using SerialEM” for all datasets. 


Image processing and 3D reconstructions 

Dose-fractionated image stacks were subjected to beam-induced 
motion correction and dose-weighting using MotionCor2”*. Contrast 
transfer function (CTF) parameters for each non-dose weighted micro- 
graph were determined by Gctf” for the homodimer and dataset 1 of the 
heterodimer, and by CTFFIND-4.1” for dataset 2 of the heterodimer. For 
all datasets, particle selection, 2D and 3D classification were performed 
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onabinned dataset (pixel size 1.72 A and 4.24 A for the heterodimer and 
homodimer, respectively) using RELION (versions 3.0 and 3.1)". The 
two datasets for the heterodimer were processed individually before 
being combined following a Bayesian polishing step. A total of 538,957 
particles from 1,324 micrographs and 2,062,083 particles from 8,991 
micrographs were extracted using semi-automated particle selection 
for the heterodimer datasets 1 and 2, respectively. Both particle sets 
were then separately subjected to three rounds of 2D classification and 
two rounds of 3D classification. Particles in both sets were subjected to 
Bayesian polishing individually and then combined for a total of 286,140 
particles. The merged dataset was fit for CTF parameters (per particle 
defocus and astigmatism, per micrograph B-factor) and estimated 
for anisotropic magnification and beam-tilt. A final 3D refinement 
was followed by post-processing using a mask that excluded the GDN 
micelle density. A focused refinement was also carried out using amask 
encompassing the VFT and linker regions of GABA,. For the GABA,;, 
homodimer structure, a total of 2,278,113 particles were extracted from 
5,602 micrographs using semi-automated particle selection. Particles 
were subjected to multiple rounds of 2D and 3D classification, untila 
subset of 282,811particles was selected for the final map. The particle set 
underwent multiple rounds of CTF parameter fitting and was subjected 
to Bayesian polishing before 3D refinement and post-processing of the 
final map. UCSF Chimera* was used for map and model visualization. 


Model building 

The initial model for the VFT was taken from the inactive-state VFT 
crystal structure (PDB 4MR7”) and the initial structure of the trans- 
membrane domain of GABA,;, was generated as ahomology model from 
the inactive cryo-EM structure of mGlu5 (PDB 6N52™) (38% sequence 
similarity to GABA,,) using Schrédinger’s Prime homology model- 
ling**. Both components were placed into the GABA, cryo-EM map 
using ‘fit-in-map’ function of Chimera. The linker, intracellular loops 
and extracellular loops of GABAg, were interactively adjusted into 
the electron microscopy map using Coot (version 0.8.9.1el)* and the 
resulting model of the GABA, linker-7TM was then used to generate 
a homology model of GABAg;, using Schrodinger’s Prime homology 
modelling, which was also placed into the map in Chimera. Iterative 
rounds of interactive model adjustment in Coot followed by real-space 
refinement in Phenix (version 1.17.1-3660)*°, using secondary struc- 
ture restraints in addition to the default restraints, were performed 
to improve the modelling. The resulting structure of GABAg, was suf- 
ficiently different to the original homology model such that that the 
root mean square deviation (r.m.s.d.) between the homology model and 
the final model is non-trivial (about 2.0 Ain the transmembrane helices 
alone). Once confidence in the side-chain placement was reached for 
the ligand-binding cleft on GABA,,, the GemSpot pipeline” was used 
to model the inhibitor, CGP55845, into the map. After further improve- 
ment, 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphoethanolamine (POPE) 
was modelled in the transmembrane pocket of GABA,, and GABA,> 
with GemSpot. Final model refinement was performed with Phenix®. 


Molecular dynamics simulations and analysis 

To prepare the system for molecular dynamics simulations, the 
low-resolution features of the map were used to manually build intra- 
cellular loop 2 (ICL2) into the model of the GABA;;-GABAg,, inactive 
heterodimer using Coot. The system was then prepared in Maestro, 
version 2019-4 (Schrédinger) to build any stubbed side chains and 
determine protonation states. The VFTs were removed from the het- 
erodimer to produce a truncated construct starting at residues T461 
for GABAg,y, and T468 for GABAg», thus containing only the linkers and 
the transmembrane domains. The Orientations of Proteins in Mem- 
branes (OPM)*° webserver was used to orient the system with respect 
to a membrane plane and the CHARMM-GUI” was used to generate 
a PDB file of the system in either a 1-palmitoyl-2-oleoyl-sn-glycero-3 
-phosphocholine (POPC) and cholesterol bilayer or a 3:1 POPC:POPE 


and cholesterol bilayer. Approximate dimensions for the system were 
105x105 x 110 A for atotal of 240 lipid and 7 cholesterol molecules. This 
bilayer was then solvated in TIP3P water with 150 mM sodium chloride 
ions balanced to achieve charge neutrality. The salt concentration of 
150 mM is consistent with the conditions at which the receptor was 
purified. POPE was used for the lipid in the transmembrane binding 
sites of GABA,; and GABA,. 

The PDB file for the full solvated system was prepared in VMD (ver- 
sion 1.9.3)** for simulation in NAMD (version 2.13)” to produce a pro- 
tein structure file (psf). The OPLS-AA/M” force field was used for the 
protein, and OPLS-AA” was used for the lipids, cholesterol and ions. 
Disulfide bonds were placed between C546 and C644 in GABAg, and 
C553 and C648 in GABA,, and both the N and C termini were blocked 
with capping groups, acetylated N termini and N-methylamide C ter- 
mini. NAMD was used to run molecular dynamics simulations, in which 
all phases used periodic boundary conditions with nonbonded interac- 
tions smoothed starting at 10 A to 12 A, with long-range interactions 
treated with the particle mesh Ewald method. Systems were minimized 
for 2,000 steps and then slowly heated in the NPT ensemble with a Lan- 
gevin thermostat anda Nosé-Hoover Langevin piston barostat set at 1 
atm witha period of 50 fs anda decay of 25 fs. A 2-fs time-step was used 
with the SHAKE*! and SETTLE” algorithms. Heating occurred from Oto 
310 K inincrements of 20 K with 0.4 ns of simulation at each increment. 
Harmonic restraints of 1 kcal per mol per A? were used during heating 
on all non-hydrogen atoms of the protein and lipids. The system was 
then equilibrated with 1 kcal per mol per A? harmonic restraints onall 
protein and lipid non-hydrogen atoms for 10 ns followed by another 
10 ns of equilibration with 1 kcal per mol per A2harmonic restraints on 
non-hydrogen backbone atoms. Finally, 1 kcal per mol per A? harmonic 
restraints were applied to only Ca atoms for 2 ns before being stepped 
downto 0.5 kcal per mol per A? for 2ns, 0.3 kcal per mol per A? for 2ns, 
and then removed. The first 30 ns of unrestrained molecular dynamics 
were also discarded as equilibration. 

All trajectories were downsampled by 10x for analysis. Cavity volume 
was calculated with Epock (1.0.5)? in VMD* on trajectories that had 
been aligned to either GABA,, or GABA,, from the starting structure. The 
cavity region was defined to include the binding region of the hydro- 
phobic tails of the lipid. Transmembrane-transmembrane distances 
were calculated in VMD based onthe Ca position of residues: 3.33, 4.50, 
5.40, 6.54, and 7.28 in the Ballesteros-Weinstein™ numbering scheme. 


Transfection and seeding of cells for signalling assays 
HEK293 cells (ATCC CRL-1573) were transfected with expression vec- 
tor DNAs encoding the two GABA, receptor protomers and a chimeric 
GO,/95 Subunit (five C-terminal amino acids of Ga, were exchanged with 
those of Ga,) to allow the Ga,,,-coupled GABA, receptor to activate PLC 
and induce IP, and intracellular Ca”* release®. Before transfection, cells 
were brought into suspension by trypsinization and resuspension to 
0.18 million cells per mlin growth medium (D-MEM, Gibco 10566016; 
supplemented with 10% fetal bovine serum, Gibco 10270106; 1% sodium 
pyruvate, Gibco 11360039; 1% MEM non-essential amino acids, Gibco 
11140068; and 1% penicillin-streptomycin Solution, Gibco 15140122). 
For each 1 ml of cell suspension transfected, a total of 1 pg DNA in 
25 pl OptiMEM (Gibco 51985) was incubated for 20 min witha mixture of 
57 pl OptiMEM and 3 pI FuGene6 (Promega E2692). After FuGene6-DNA 
complex formation, the mixture was added directly to the cell suspen- 
sion, mixed thoroughly and cells seeded with 100 I cell suspension in 
appropriate 96-well plates. Of the 1 pg DNA per ml cell suspension, the 
amount of expression vector DNA encoding the chimeric Ga); was 
0.5 pg/ml cell suspension in all experiments. The amount of GABA, 
encoding DNA was varied between 7.8 ng and 0.25 1g for each of the 
GABA,-receptor protomer DNAs depending on the assay and mutants 
tested. Empty vector DNA was added to give a total amount of 1g DNA 
per ml cell suspension transfected. For characterization of basal activity 
ofthe TM3 and TMS protomer mutants, atypical gene dose experiment 


was performed. DNA corresponding to 62.5 ng DNA per ml cell suspen- 
sion of each of the protomers (wild type or mutants) were mixed and 
serially diluted twofold 5 times, typically downto 3.9 ng DNA per ml cell 
suspension. The transfected cell suspension was seeded at 100 pg/ml 
both in clear poly-L-lysine coated 96-well plates for IP; accumulation 
assays and in white poly-L-lysine coated 96-well plates for cell-surface 
enzyme-linked immunosorbent assay (ELISA) assays. 


IP, accumulation assays 

The IP, assays for wild-type and mutant receptors were performed 
essentially as previously described”. Forty-eight hours after transfec- 
tion, the growth medium was replaced with HBSS buffer (HBSS (Gibco 
14025),20 mM HEPES pH 7.5, 1mM CaCl,, 1 mM MgCl, and 0.1% BSA) 
supplemented with BSA to 0.5% and incubated at 37 °C for 3-4 h. 

For characterization of the TM3-TMS protomer-interface mutants 
for basal activity, the HBSS + 0.5% BSA buffer was replaced with 100 ul 
HBSS buffer, followed by addition of 50 pl HBSS buffer containing LiCl 
(150 mM) to give a final concentration of 50 mM LiCl. After incubation 
for 1h at 37 °C the IP, accumulation was stopped by addition of 40 ul 
CisBio IP-One Tb HTRF Kit (CisBio, 62IPAPEC) lysis buffer. The accu- 
mulated IP, levels were determined according to the manufacturer’s 
instructions and as previously described". 

For the generation of GABA concentration-response curves, the 
compounds were diluted in three times the final concentration in HBSS 
buffer containing 60 mM LiCl. The assay was started, first by replac- 
ing the HBSS + 0.5% BSA buffer with 100 ul HBSS buffer, followed by 
addition of 50 pl of the above compound dilutions to give a final LiCl 
concentration of 20 mM. The IP, accumulation assay was stopped and 
assayed as described above after incubation for 1h at 37 °C. Data were 
calculated as the amount of IP, formed per well or normalized to the 
basal IP, level, and fitted by nonlinear regression using GraphPad Prism. 
Results and description of statistical analyses used in this Article can 
be found in Supplementary Table 1. 


Cell-surface ELISA assay 

Surface expression levels of wild-type and mutant GABA, receptors 
were determined using a direct ELISA against the N-terminal GABA,;, 
Flag tag and the N-terminal GABA,, HA tag, as previously described®. 
Transfected cells were seeded in white poly-D-lysine-coated 96-well 
plates. Forty-eight hours after transfection, cells were washed once with 
100 ul per well DPBS +1mM CaCl, (wash buffer). Following fixation with 
50 ul per well 4% paraformaldehyde solution for 5 min at room tem- 
perature, cells were washed twice with 100 pl wash buffer and blocked 
with 100 pl per well blocking solution (3% dry milk, 1 mM CaCl,,50 mM 
Tris-HCl, pH 7.5) for 30 min at room temperature, followed by addition 
of 75 pl per well HRP-conjugated anti-Flag antibody (Sigma Aldrich, 
A8592), or HRP-conjugated anti-HA antibody (R&D systems HAMO601), 
both diluted 1:2,000 in blocking solution, and allowed to incubate for 
1hat room temperature. The plates were then washed four times with 
100 ul per well blocking solution followed by four washes with wash 
buffer. The amount of surface-expressed receptors was detected by 
adding 60 pl wash buffer and 20 pl HRP substrate (Bio-Rad, 170-5060) 
per well, incubating for 10 min and measuring of luminescence in an 
EnVision plate reader (Perkin Elmer). 


Statistics and reproducibility 

Detailed results and description of statistical analyses used in this 
Article can be found in Supplementary Table 1. Data and error bars in 
Figs. 1c, d are mean +s.e.m. from at least five independent experiments. 
In Fig. 1c, data of co-expressed wild-type GABA, (6.3 ng) with wild-type 
GABA,, (6.3 ng), wild-type GABA,, (3.1 ng) with wild-type GABA,, 
(3.1ng), and wild-type GABA,, (1.6 ng) with wild-type GABA,, (1.6 ng) rep- 
resent n=7 independent experiments; data of co-expressed wild-type 
GABA,, (0.8 ng) with wild-type GABA,, (0.8 ng), solely expressed 
wild-type GABA,, (6.3 ng), solely expressed wild-type GABA,, (3.1ng), 


and untransfected HEK293 cells represent n = 6 independent experi- 
ments. In Fig. 1d, data of co-expressed GABA;,(A627-634) with wild-type 
GABA,,, and wild-type GABA,, with GABA,,(A631-638) represent n=7 
independent experiments; and co-expressed GABA,,(A627-634) 
with GABA,,(A631-638) represent n =5 independent experiments. 
Data in Fig. 2d are representative of one experiment performed in 
triplicate and repeated independently at least three times with simi- 
lar results (n = 3), data points and error bars are mean + s.e.m. Data 
in Fig. 2e are mean + s.e.m. from at least three independent experi- 
ments; data for co-expressed GABA;,(H572A/E673A) with wild-type 
GABA,,, and wild-type GABA,,with GABA,,(H579A/E677A) represent 
n=3 independent experiments; and data for wild-type GABAg,, with 
wild-type GABA,, represent n = 7 independent experiments. Data in 
Fig. 3d are mean +s.e.m. from at least four independent experiments; 
co-expressed GABA,,(L553W) with GABA,,(L560W) represent n=S5inde- 
pendent experiments, and GABA,,(R549A) with GABA,,(R556A) repre- 
sent n=4 independent experiments, and data for wild-type GABA,, with 
wild-type GABA,, represent n=7 independent experiments. Ensemble 
data in Fig. 3e represent distribution over a 200-ns time course, for 5 
simulations per condition; each violin represents n=37,500 time points. 


External computational resource use 

The Extreme Science and Engineering Discovery Environment (XSEDE)°° 
resource comet-gpu through sdsc-comet allocation TG-MCB190153 was 
used in preparation of this Article. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| Sample preparation, cryo-EM processing and 
reconstruction of GABA, heterodimer. a, Purification scheme for GABAg. 
b,c, Representative cryo-EM micrograph of 10,315 collected (b) and 2D class 
averages (c) of GABA, dimers. d, Flow chart outlining the cryo-EM processing 
workflow using RELION™, the global resolutions of the full-length structure 
and VFT focused structures were 3.6 Aand 3.5A, respectively, at 0.143 Fourier 
shell correlation (FSC) as calculated by RELION. e, f, Gold-standard FSC curve of 


half-maps calculated using RELION and Phenix Mtriage*s (e), and map-to-model 
validation curves generated through Phenix Mtriage (f). g, Local resolution of 
cryo-EM maps. h, Angular distribution of projections used in final cryo-EM 
reconstruction. i, Ordered cryo-EM densities (light yellow), probably 
corresponding to GDN and/or cholesteryl hemisuccinate, are found at the TM5 
interface of GABA,, and GABA). 
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Extended Data Fig. 2| Agreement between cryo-EM map and model. CGP55845 and GABA,, linker, in which thresholds of 0.01, 0.0189 and 0.0127 
a, Electron microscopy density and model for GABA, heterodimer complex; were used, respectively. b, Electron microscopy density and model for GABA,; 
transmembrane helices of GABAg,, transmembrane helices of GABAg,, linker homodimer; transmembrane helices and linker region of both protomers, 


region, bound phosphatidylethanolamine (PE) and ligand CGP55845. Densities | 7TM-bound PE and ligand CGP55845. Densities were zoned at 2.2 and threshold 
visualized within UCSF Chimera*? and zoned at 2.2 with threshold set to 0.0142, set to 0.016, apart from CGP55845 and the ECL2 in both protomers, in whicha 
with the exception of the following: GABA,,bound lipid,GABA,,-bound threshold of 0.03 or 0.02 was used, respectively. 
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Extended Data Fig. 3 | Binding of CGP55845 and cation to GABAg, VFT. 

a, Model of CGP55845 within the VFT of GABAg,,. The entire ligand is confined 
by W65 and W278 of GABAg,, which form hydrophobic interactions with the 
chlorinated ring of CGP55845. b, Schematic of interacting residues on GABAg;, 
with the inhibitor, CGP55845. GABAg,;, residues S153 and S130 form hydrogen 
bonds with oxygen atoms of the phosphate group, and H170 and E349 

forma hydrogen bond anda salt bridge with the amine group of the ligand, 
respectively. m-m stacking occurs between the chlorinated ring structure of 
CGP55845 and W278, and W65 provides hydrophobic packing on the opposing 
side of the ring. S130, H170, E349 and W6Sare all substantially different 
residues in GABA,,, precluding ligand binding. Residues are colour-coded 
corresponding to their properties: light blue, hydrophilic; orange, anionic; 
green, hydrophobic; yellow, glycine; and grey, cysteine. Interaction lines are 
also colour-coded according to their type: light blue, side-chain hydrogen 
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bonding; grey, backbone hydrogen bonding; blue-red gradient, salt bridge; 
and green, m-m stacking. c, Overlay of CGP55845-bound GABA, cryo-EM 
structure (tan and teal) with CGP54626-bound GABA, crystal structure (pink) 
(PDB 4MR7) or apo-state GABA, crystal structure (grey) (PDB 4MQE), resulting 
inr.m.s.d. values of 1.30 Aand1.28 A, respectively”. d, Comparison of ligand 
pose between CGP55845 (yellow) and CGP54.626 (blue), which differs from 
CGP55845 only ina substitution of an aromatic ring in place of cyclohexane. 

e, Spherical density surrounded by anionic residues within the VFT supports a 
cation (magenta) at that site. The presence of a metal ion would be consistent 
with the observation that calcium and other divalent ions affect ligand 
affinity®°°, and examination of the deposited scattering factors for the 
high-resolution VFT crystal structure (PDB 4MR7”) reveals positive difference 
density also consistent witha cation at this site. 
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Extended Data Fig. 4| Comparison of structures across GPCR classes. 

a, GABA, protomers share similar secondary and overall structure. 

b, Comparison of mGlu5 and GABA, 7TM and ECL2-linker shown from side 
view. c, Top-down view of GABAs,, GABA,2, mGluS (PDB 6N52") and family AM2 
acetylcholine receptor (PDB 3UON”) with sidechains corresponding to the 
toggle-switch motif shown. Phospholipid space-filling model is included in 
grey within GABA,, and GABA,,. d, Sequence alignment of human GABA, 


receptors with mGluS and M2 receptors, comparing canonical GPCR activation 
motifs: TM3-TM6 ionic lock (blue), toggle-switch motif (green) and FXPKXY 
motif (yellow). Sequences are aligned to motifs within each transmembrane 
helix and transmembrane helical secondary structure is underlined. Residues 
in GABA, sequences differing from canonical motifs are outlined in pink. The 
cysteine residue that replaces the toggle-switch tryptophan is highlighted in 
pink. 
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Extended Data Fig. 5 | Atomistic simulations of phospholipid structural helices in simulations with and without core-bound lipid. Distances were 
stabilization and entry. a, The results of three out of seventotalsimulationsof | measured fromthe Ca atoms of the following residues in GABAg,,: L550 
GABA, 7TMand linker in the absence of core-bound lipid after 200 ns. The (TM3), W611 (TM4), L653 (TMS), M707 (TM6) and A716 (TM7); and the following 
results show the extent of lipid (green and purple) entry in GABA,, (grey) inthe residues in GABA,,: T557 (TM3), W615 (TM4), L657 (TMS), F711 (TM6) and Q720 
simulations versus the experimental structure (tan) with PE (orange). One of (TM7). Simulations were run over 200 ns and violin plots represent n= 37,500 
the trajectories was extended an additional 100 ns (rightmost panel) and lipid data points and 5 simulations per condition. e, 7TM cavity volume 
entry was observed to progress towards the core. b, Representative side view measured over time for individual simulations. Average cavity volume of 
of the GABA, ribbon and stick model from simulations at 200 ns, showing phospholipid-bound receptor is shown as dashed line, thick lines indicate 
persistence of the ECL2 and B-sheet structure even in absence of the VFTs. rolling averages of 5.33 ns and thin lines represent raw data. 


c,d, Violin plots of ensemble distances between GABA,, (d) and GABA,, (e) TM 


Article 


a —©- GABA, (H572A/E673A) © 4509-] ~O> GABAs: (HS79AIE677A) 
—d- GABA,, (H572A) —d- GABA,, (H579A) 
as ag *° ~V- GABA, (E673A) ~V~ GABA,, (E677A) 
a5 ay = 
ge 62 = 1000 
3 f Be A 
gu BIg 0.8 3 
ay GO ae 
Zo es a 500 
Fe Bs : 
2a 20 
0.0 0 
0.0 0.5 1.0 0.0 0.5 1.0 0.0 0.5 1.0 
Normalized GABA,, Normalized GABA,, Normalized GABA,, Normalized GABA,, 
Surface Expression Surface Expression Surface Expression Surface Expression 
Gd GABAs: _GABAs. e f g 
SSS 600 
WT WT ro 
GABAx = L a 
H572A/E673A WT surface = A fo} = 
expression £400 £ 
WT H579A/E677A > = HO = 
s s 
GABA, a. 200 Mal a 
- WT surface = = 
expression 
- H579A/E677A 
0 200 400 600 8001000 0 1 2 0.0 0.5 1.0 0.0 0.5 1.0 
IP, /well (nM) Normalized GABA Normalized GABA,, Normalized GABA,, 
Surface Expression Surface Expression Surface Expression 
h 


@ GABAz: (6.3 ng) 
+ GABAs: (6.3 ng) 


© GABAs: (0.8 ng) 
+ GABAsz: (0.8 ng) 


Ay GABAs: 4627-634 (25 ng) 
+ GABAg: WT (6.3 ng) 


@ GABAs: (3.1 ng) 
+ GABAz: (3.1 ng) 


@ GABAs: (1.6 ng) 
+ GABAz: (1.6 ng) 


@ GABAz: (6.3 ng) WV GABAz: WT (25 ng) 


+ GABAs, 4631-638 (25 ng) 


@ GABAs; A627-634 (25 ng) 
+ GABAs, AG31-638 (25 ng) 


@ GABAz: (3.1 ng) 


Extended Data Fig. 6| Functional analysis of GABA, mutants. 

a, Comparative normalized surface-expression levels of constructs. 
Surface-expression levels of wild-type and mutant GABA, receptors were 
determined using a direct ELISA against the N-terminal GABA,, Flag tag and the 
N-terminal GABA,, HA tag. Surface-expression levels were normalized tothe 
surface expression of a wild-type GABA, receptor when 6.3 ng Flag-tagged 
GABA,, and 6.3 ng HA-tagged GABA, DNA was used for transfection. The line 
x=yindicates similar surface expression of GABA, subunits as evident from 


transfection with lower amounts of Flag-tagged GABA,, and HA-tagged GABA). 


Values above the line have greater GABA,, surface expression relative to 
GABAyg,, and values below the line have greater GABA,, surface expression 
relative to GABA,,. GABA,, did not reach the cell surface (data point hidden 
behind triangle at the coordinate (0,0) in the left subpanel). b,c, Mutations of 
ionic residues forming the interface of GABAg, (b) and GABA, (c) result in 
increased constitutive activity of the receptor. To achieve a range of expression 
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levels of the mutants in b andc, the DNA amounts transfected were serially 
diluted twofold from 3.3 ng (mutants in b) or 6.3 ng (mutants inc) of each 
subunit, respectively. d,e, GABA,,(H579A/E677A) expressed without GABAg,, 
shows amoderate increase in basal activity over wild-type GABA,, expressed 
alone (d), although surface expression (e) of the wild-type GABA,, subunit was 
higher than of GABA,,(H579A/E677A). f, g, Mutation of lipid coordinating 
residues (blue triangle) increase the constitutive activity of GABAg, (f) and 
GABA, (g), whereas mutations displacing the lipid tails from the 7TM core 
(purple triangle) result in decreased basal activity of the receptor when 
compared to wild-type receptor of similar receptor surface expression. h, Key 
to colours and symbols used ina, f, and g with DNA transfection amounts 
indicated in parentheses. Data are representative of one experiment 
performed in triplicate and repeated independently at least three times with 
similar results (n =3 independent experiments), error bars represent 
mean+s.e.m. 


Extended Data Fig. 7 | Modelling of phospholipid into GABAg. a, Schematic 
of GABA,, residues interacting with the polar headgroup of PE. The terminal 
amine (-NH*) forms a salt bridge with residue D714 in ECL3, and the phosphate 
group is coordinated by S710 of ECL3, R549 of TM3 and the backbone nitrogen 
of the C644 of ECL2.b-d, As our receptor purification did not contribute 
additional lipid, we considered the known lipid composition of Sf9 insect 
cells. Four primary phospholipids are present in Sf9 insect cells: phosphati- 
dylcholine (43%), phosphatidylethanolamine (32%), phosphatidylinositol (23%) 
and cardiolipin (4%). It was apparent from the map that the lipid had only two 
carbon chains, immediately excluding cardiolipin as it has four hydrocarbon 
tails. A comparison of the map and the binding site residues led to the decision 


to model PE (b) into the pocket using the GemSpot pipeline”, which produced 
good cross-correlation with favourable interactions. To further confirm our 
selection, analysis of overlays of phosphatidylcholine (c) and phosphati- 
dylinositol (d) over the docked model revealed phosphatidylcholine is unlikely 
given that the interactions with the cation appear to be primarily salt bridges, 
rather than the cation-m interactions that more commonly coordinate 

choline in proteins®. Although phosphatidylinositol may make favourable 
interactions, our map does not appear to support sucha large moiety in the 
headgroup position. Thus, PE is the most likely lipid to reside in the structure 
and was therefore used in the models. 
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Extended Data Fig. 8| Cryo-EM processing workflow of GABA,,, 
homodimer. a, Flow chart outlining the cryo-EM processing of the GABA, 
homodimer. b,c, Representative micrograph of 5,602 collected (b) and 2D 
class averages (c).d, Local resolution of cryo-EM maps. e, Angular distribution 
of projections used in the final cryo-EM reconstruction. f, g, Gold-standard FSC 
curve of half-maps calculated using RELION and Phenix Mtriage* (f), and 


1/Resolution [1/A] 


mGlu5 (Active) 


GABA,, (Homodimer) 


map-to-model validation curves generated through Phenix Mtriage (g). Global 
indicated resolutions of the full-length structure and VFT structure were 
3.2Aand3.1A, respectively, at 0.143 FSC as calculated by RELION. h, Electron 
microscopy maps of active mGlu5 (purple) (EMD-0345"), and homodimeric 
GABAg, (green). The GABA,, homodimer adopts an overall architecture similar 
to that of active mGlu5. 


Extended Data Table 1| Cryo-EM data collection, refinement and validation statistics 


Data collection and processing 
Magnification (x) 
Voltage (kV) 
Electron exposure (e—/A?) 
Defocus range (um) 
Pixel size (A) 
Symmetry imposed 
Initial particle images (no.) 
Final particle images (no.) 
Map resolution (A) 

FSC threshold 
Map resolution range (A) 
Map sharpening B factor (A?) 


Refinement 

Initial model used (PDB code) 
Model resolution (A) 

FSC threshold 


Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 

B factors (A”) 
Protein 
Ligand 

R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 

Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 

Ramachanaran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 
n/a | Confirmed 

| x The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 

[| xX A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


C x The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


XI} A description of all covariates tested 
xX [| A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


O x A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


O x For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


xX [| For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


xX [| Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection CryoEM Data was collected automatically on a Titan Krios (FEI) using SerialEM. 


Data analysis The following software were used in this study: MotionCor2, Gctf, CTFFind-4.1, Relion3.0, Relion3.1, USCF Chimera 1.14, Coot (0.8.9.1el), 
Phenix (1.17.1-3660), GemSpot, Maestro (2019-4), OPM, CHARMM-GUI, VMD (1.9.3), NAMD (2.13), Epock (1.0.5), GraphPad Prism 8. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


All data generated or analyzed during this study are included in this article and the Supplementary Information. CryoEM maps of GABAB heterodimer and GABAB1b 
homodimer have been deposited in the Electron Microscopy Data Bank under accession codes EMD-21533 and EMD-21534, respectively. The atomic coordinates of 
GABAB heterodimer and GABAB1b homodimer have been deposited in the Protein Data Bank under the accession codes 6W2X and 6W2Y, respectively. 
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Life sciences study design 
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the microscope. 
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Data exclusions No data was systematically excluded. Generation of maps from cryo-EM particles involves use of Relion (3.0 or 3.1) to sort particles, and 
remove damaged or poor quality particles to achieve a high-resolution final reconstruction. 


Replication Cellular signaling assays were replicated in at least three independent experiments for each condition tested. All replication attempts were 
successful. 


Randomization No randomization was attempted or needed. 


Blinding No blinding was attempted or needed. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
| Antibodies | ChIP-seq 
| Eukaryotic cell lines [| Flow cytometry 
|__}| Palaeontology | MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 
Antibodies 
Antibodies used HRP-conjugated anti-FLAG antibody (Sigma Aldrich, A8592) and HRP-conjugated anti-HA antibody (R&D systems HAMO0601), 
Validation Antibodies are validated by supplier. No additional validation was conducted by authors of this study. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HEK293 cells (ATCC® CRL-1573™) 
Sf9, Expression Systems, Cat 94-001S 


Authentication Cell lines are maintained by the supplier. No additional authentication was performed by the authors of this study. 
Mycoplasma contamination Cell lines are tested by manufacturer for contamination and no additional testing was performed by the authors of this study. 


Commonly misidentified lines None used. 
(See ICLAC register) 
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Author Correction: 
Structural insights into 
[!-Opioid receptor 
activation 


https://doi.org/10.1038/s41586-020-2542-z 


Correction to: Nature https://doi.org/10.1038/nature14886 


Published online 5 August 2015 


® Check for updates 


Weijiao Huang, Aashish Manglik, A. J. Venkatakrishnan, 
Toon Laeremans, Evan N. Feinberg, Adrian L. Sanborn, 
Hideaki E. Kato, Kathryn E. Livingston, Thor S. Thorsen, 
Ralf C. Kling, Sébastien Granier, Peter Gmeiner, 
Stephen M. Husbands, John R. Traynor, William |. Weis, 
Jan Steyaert, Ron O. Dror & Brian K. Kobilka 


Inthis Article, the configuration of BU72 at the benzylic amine was mis- 
takenly reported as (5S). Re-fitting this stereocentre to (R) improved the 
fit of the observed density in the X-ray structure. The (S)-configuration 
was originally assigned on the basis of the expected stereochemistry 
from ref.*. The origin of the inversion is unclear, but we propose that the 
original assignment, based only on nuclear Overhauser effect experi- 
ments, may well have been incorrect. We thank Thomas Munro for 
bringing this error to our attention. The Protein Data Bank (PDB) entry 
of the active p-opoid receptor bound to the BU72 agonist has been 
updated as 5CIM. The original Article has not been corrected online. 


1. Husbands, S. M. & Lewis, J. W. Morphinan cyclic imines and pyrrolidines containing a 
constrained phenyl group: high affinity opioid agonists. Bioorg. Med. Chem. Lett. 5, 


2969-2974 (1995). 
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Author Correction: 
Reversing a model of 
Parkinson’s disease with 
in situ converted nigral 
neurons 
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Correction to: Nature https://doi.org/10.1038/s41586-020-2388-4 


Published online 24 June 2020 


® Check for updates 


Hao Qian, Xinjiang Kang, Jing Hu, Dongyang Zhang, Zhengyu Liang, 
Fan Meng, Xuan Zhang, Yuanchao Xue, Roy Maimon, Steven F. Dowdy, 
Neal K. Devaraj, Zhuan Zhou, William C. Mobley, Don W. Cleveland & 
Xiang-Dong Fu 


In Fig. Se of this Article, the labels ‘Empty’ and ‘shPTB’ were inadvert- 
ently swapped. In addition, ‘Peking-Tsinghua Center for Life Sciences’ 
should be added to affiliation 2 (associated with authors Xinjiang Kang 
& Zhuan Zhou) such that the full affiliation reads: ‘State Key Laboratory 
of Membrane Biology and Peking-Tsinghua Center for Life Sciences, 
Institute of Molecular Medicine, Peking University, Beijing, China.. The 
original Article has been corrected online. 
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Corrections & amendments 


Author Correction: The 
evolutionary history of 
lethal metastatic 
prostate cancer 


https://doi.org/10.1038/s41586-020-2581-5 


Correction to: Nature https://doi.org/10.1038/nature14347 


Published online 1 April 2015 


® Check for updates 


Gunes Gundem, Peter Van Loo, Barbara Kremeyer, 

Ludmil B. Alexandrov, Jose M. C. Tubio, Elli Papaemmanuil, 

Daniel S. Brewer, Heini M. L. Kallio, Gunilla Hagnas, Matti Annala, 
Kati Kivinummi, Victoria Goody, Calli Latimer, Sarah O'Meara, 
Kevin J. Dawson, William Isaacs, Michael R. Emmert-Buck, 

Matti Nykter, Christopher Foster, Zsofia Kote-Jarai, Douglas Easton, 
Hayley C. Whitaker, ICGC Prostate UK Group’, David E. Neal, 

Colin S. Cooper, Rosalind A. Eeles, Tapio Visakorpi, Peter J. Campbell, 
Ultan McDermott, David C. Wedge & G. Steven Bova 


In the Supplementary Information of this Article, some errors were 
made in compiling the Supplementary Data file called ‘Supplementary 
Variant Lists’. In the first worksheet (entitled ‘subs’) the clusters to 
which some single nucleotide variants (SNVs) were assigned were not 
reported and were replaced with ‘NA’ for ‘not available’. The correct 
cluster numbers have now been inserted. Three mutations occurring 
intumour A32 were incorrectly assigned to cluster 3 and have now been 
correctly assigned to cluster 2. Duplicate entries were present for some 
mutations in A32. These duplicate entries have now been removed. In 
the fourth worksheet (entitled ‘copy_number’) incorrect copy number 
information was reported for sample A31-c. The previous information, 
implying that A31-c was primarily diploid, has now been replaced with 
the copy number call used within the Article, which indicates that A31-c 
is primarily tetraploid. The Supplementary Information to this Amend- 
ment contains the corrected Excel file ‘Supplementary Variant Lists’. 
These corrections do not change the authors’ interpretation of the 
findings. The original Article has not been corrected. 


Supplementary information is available in the online version of this Amendment. 


Correspondence should be addressed to D.C.W. 


*A list of participants of the ICGC Prostate UK Group and their affiliations appears in section 5 of the Supplementary Information of the original Article. 
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Author Correction: 
The dental proteome of 
Homoantecessor 


https://doi.org/10.1038/s41586-020-2580-6 


Correction to: Nature https://doi.org/10.1038/s41586-020-2153-8 


Published online 01 April 2020 


® Check for updates 


Frido Welker, Jazmin Ramos-Madrigal, Petra Gutenbrunner, 
Meaghan Mackie, Shivani Tiwary, Rosa Rakownikow 
Jersie-Christensen, Cristina Chiva, Marc R. Dickinson, 

Martin Kuhlwilm, Marc de Manuel, Pere Gelabert, 

Maria Martinon-Torres, Ann Margvelashvili, Juan Luis Arsuaga, 
Eudald Carbonell, Tomas Marques-Bonet, Kirsty Penkman, 
Eduard Sabido, Jiirgen Cox, Jesper V. Olsen, David Lordkipanidze, 
Fernando Racimo, Carles Lalueza-Fox, José Maria Bermudez 

de Castro, Eske Willerslev & Enrico Cappellini 


Inthis Article, the sentence beginning ‘However, pairwise divergence...’ 
erroneously cited Fig. 3b instead of Fig. 2b. The full sentence should 
therefore read: ‘However, pairwise divergence of the amino acid 
sequences between H. antecessor and the clade containing H. sapiens, 
Neanderthals and the Denisovan is larger than the divergence between 
themembersofthisclade(Fig.2b, Supplementary Fig.12, Supplementary 


Information).. The Article has been corrected online. 
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Success in a job application depends on intangible factors as well as publication metrics. 


LOOK BEYOND 


METRICS 


Survey explores how and why certain people get 
job offers. By Nina Notman and Chris Woolston 


t takes at least 15 job applications to land 

a single offer, finds a survey of 317 early- 

career researchers who applied for 

faculty positions in a range of nations 

(J. D. Fernandes et al. eLife 9, e54097; 
2020). The results have shed light on a hiring 
process that is often opaque, frustrating and 
hard to predict. Contrary to common belief, 
the authors found that a publication ina 
high-profile journal isn’t an absolute prereq- 
uisite for a successful application. 

The survey was conducted by members of 
the Future PI Slack community, a postdoctoral 
support group. They collected responses 
from researchers who had applied for fac- 
ulty positions between May 2018 and May 
2019. Respondents hailed from 13 countries, 
although 72% were from the United States; 85% 
were in the life sciences. Overall, 58% received 
job offers, significantly above the average 
from other studies, suggesting that successful 
applicants were especially willing to take the 


survey. Only 26% had an authorship credit in 
Cell, Nature or Science. 

The survey tracked conventional metrics 
of success such as fellowships, citations and 
publications, and found that these measures 
were only modestly effective at predicting 
which applicants would get job offers. The 
authors tried to construct a flow chart to pre- 
dict the applicants’ fate, but it was less than 
60% accurate. 


More is more 

One clear lesson was that jobseekers shouldn't 
skimp on applications. Those who submitted 
more than 15 applications landed substantially 
more on-site interviews than those who didn't. 
They also received more job offers, but the cor- 
relation wasn’t so strong, suggesting that the 
shotgun approach to applications can go only 
so far. Jobseekers do better when they send 
applications that are precisely tailored to the 
position sought, says survey co-author Chris 
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Smith, manager of the postdoc programme 
at North Carolina State University in Raleigh. 

Publication in a high-profile journal helped 
applicants, but it was no guarantee of success, 
the study found. Jobseekers who were first 
authors of a paper in Cell, Nature or Science 
received offers on 11% of their applications. For 
people who had no papers in those journals, 
the success rate per application was 2%. 

The authors also polled 15 faculty mem- 
bers who had served on hiring committees 
to find out what makes an application suc- 
cessful. The results suggest that committees 
seek candidates who will be good colleagues 
and scientists, says the study’s lead author, 
Amanda Haage, a biomedical scientist at the 
University of North Dakota in Grand Forks. The 
committees looked at affability as well as at 
track records, she says. 


Intangible assets 


Recruitment of academics is more holistic 
than many think, explains Robert Bowles, a 
careers adviser at the Royal Society of Chem- 
istry in Cambridge, UK. Applicants “tend to 
forget about the intangibles”, he says. “Pub- 
lications and grants buy you a ticket into the 
lottery, but they are not going to winit for you.” 

When asked for their general thoughts on 
the application process, jobseekers had a 
nearly universal negative outlook. Most were 
frustrated that they did not receive feedback 
on their applications. 

But faculty members on hiring committees 
find itimpossible to provide detailed feedback 
on all applications. All 15 surveyed said that 
they typically receive more than 100 appli- 
cations per job offer; 10 said they usually get 
more than 200. 

It’s just one more sign of a stark imbalance 
in academia. Jobseekers outnumber positions, 
and that disparity is expected to get worse in 
the wake of the current coronavirus pandemic. 

Inthis environment, postdocs should keep 
in mind that unsuccessful applications do 
not mean failure, says biochemist Rosemary 
Bass, a careers adviser at the University of 
East Anglia in Norwich, UK. “We see people 
who think that making five applications and 
not getting an interview means that they are 
a reject,” Bass says. “The current situation is 
bringing home the difficulties of the academic 
job market to PhD students and postdocs in 
a way that we as careers advisers have not 
managed to previously.” 


Nina Notman is a freelance writer in Salisbury, 


UK. Chris Woolston is a freelance writer in 
Billings, Montana. 
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The COVID-19 pandemic put paid to a planned gathering in County Tipperary, Ireland. 


TEAM BONDING IN 
AVIRTUAL STYLE 


One laboratory made the most of lockdown by 
moving a retreat online. By Rob Salguero-Gomez 


n 5 April, two weeks into the UK 
lockdown, my laboratory group and 
I were meant to have flown to Ireland. 
It would have been our first research 
retreat since I became a Department 
of Zoology faculty member at the University of 
Oxford, UK. The goal of the retreat, which was 
to have been held ina manor house in County 
Tipperary, was to enhance team dynamics and 
advance research projects. 
I wanted the retreat to go ahead, so | 
organized a virtual version. Here are some 
steps our team took to make it a success. 


Pitch the idea. Once it became clear that the 
retreat wouldn't be possible as we had planned, 
l organized an hour-long video conference 
with my group to discuss our options: can- 
cel, postpone or do something different. The 
democratic vote was for a virtual version of the 
retreat, with the in situ one to be postponed 
until it was safe to hold it. 


Plan together. At that online meeting, I asked 
my team what format it would prefer for the 
virtual retreat. We agreed to start each day at 
9 a.m. witha statement of everybody's goals for 
the day, and to continue until 5 p.m., with two 
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30-minute breaks for morning and afternoon 
coffee, and a one-hour break for lunch. These 
breaks gave members the opportunity to chat 
informally through a video call. 

We also organized a series of 30-minute 
skills workshops after lunch, run by volunteers 
from the lab, on academic writing, research 
presentations and developing grant pro- 
posals. It soon became clear to me that the 
schedule needed to be flexible because one 
of my postdocs much prefers working in the 
evening, and two PhD students are currently 
based in Brazil and California. We agreed that 
they could joinin when they woke up, and then 
carry on with writing activities during their 
normal working hours. 


Set one daily task. To help lab members be 
more productive during the retreat, I asked 
them in advance to choose a single project to 
focus on each day. Projects ranged from doing 
analyses to writing manuscripts, essays or 
grant proposals. 


Screen out e-mails and other online 
distractions. A week before the virtual retreat, 
I encouraged my students and postdocs to 
identify ‘urgent’ e-mails to be dealt with in 
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advance; all else would have to wait until after 
the retreat. As a lab, we turned on our ‘out of 
office’ automatic reply and agreed to not 
check e-mails during the retreat’s writing slots. 


Build in relaxation. In the absence of the 
Irish manor house and its surroundings, 
we organized social activities from 5 p.m. 
onwards, including virtual drinks and a 
pub quiz. We also found that having a time 
for a walk in each person’s back garden, to 
disconnect from the screen, was invaluable. 


Delegate tasks. Once I had taken care of the 
basic structure of the retreat and run the first 
session, | asked volunteers to help keep time 
and moderate subsequent events. This gave 
team members a chance to exercise these key 
skills. However, it was also especially important 
for me: acouple of weeks before the retreat, my 
wife and I started displaying the symptoms of 
coronavirus. We were not deemed ill enough 
by our doctor to seek hospital treatment. 
Instead, we managed our symptoms from 
home with medication and rest. 


Provide feedback individually. | contacted 
each team member one by one, to ask how 
they were doing and to provide feedback on 
their projects. This ensured that they did not 
get stuck while I was focused on recuperation. 


Organize a final wrap-up. We used the last 
hour of the whole retreat to discuss each lab 
member's progress and trials and tribulations, 
including my own. Creating a space for every- 
body to share their struggles regarding lack 
of motivation to write, as well as tips to do so 
efficiently, was greatly appreciated across 
the team. Each of us also explained our future 
steps, as well as setting out a rough timeline. 


Keep track of post-retreat progress. | still 
hold a virtual weekly meeting with my lab 
members individually, for us to discuss how 
retreat-related projects are going. 

In the past, I have attended writing retreats 
at which members were assigned a partner 
to check on their progress every two weeks. 
Sadly, this strategy didn’t occur to me in time 
to implement it during our retreat. | also 
organized a poll to seek feedback on the event. 


Staging the retreat virtually helped me 
navigate a fresh challenge while, | hope, 
providing my team with the support it needs 
inthe middle ofa pandemic. Now, most weeks, 
during our for-now-virtual lab meeting, I try to 
encourage the team with the promise that our 
retreat in Ireland will go ahead one day. We’re 
looking forward to it. 


Rob Salguero-Gomez is an associate 
professor in ecology at the Department of 
Zoology, University of Oxford, UK. 
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had sea worms on my mind during a 
March expedition through the Ningaloo 
Canyons off the northwest coast of 
Australia. Cruising aboard the RV Falkor, 
aresearch ship operated by the Schmidt 
Ocean Institute in Palo Alto, California, I was 
hoping to collect and sequence new deep- 
sea worm species. Everyone on board knew 
they were in for surprises, but nobody could 
have imagined the creature that suddenly 
appeared on our screen one day. 

Here, ina moment of pure excitement, 
I’m standing with Nerida Wilson, a marine 
biologist at the Western Australian Museum 
in Perth and the expedition’s chief scientist. 
Kaycee Handley, a master’s student in 
Macquarie University in Sydney, is sitting 
in rapt attention. The Falkor’s remote 
submersible had just spotted a massive 
siphonophore, a colonial organism related 
to corals, sea anemones and jellyfish. We’re 
still working on modelling the creature, but 
it’s clearly well over 100 metres long, much 


longer than any animal previously recorded. 


The screen shows the submersible’s 
robot arm using a suction tube to collect a 
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sample from the siphonophore. We're still 
sequencing the DNA to confirm the species. 
The description of the organism will be the 
first paper to come out of this cruise. 

lalso found my worms, including a 
new species of ‘green bomber’ that sheds 
glowing spheres to distract predators, 
and anew species of squid worm (genus 
Teuthidodrilus), so named because it has ten 
‘tentacles’, including two for feeding. 

Asan Australian working in California, it 
was gratifying to be doing fieldwork in my 
home country. The oceans off Australia are an 
undiscovered world, and I'd hoped to be on 
the entire month-long cruise. But I had to go 
back to my laboratory halfway through, when 
the coronavirus began interrupting travel. 

The Falkor is an amazing ship with 
an amazing crew. It offers a wonderful 
combination of science and outreach. I felt 
privileged to be on board. 


Greg Rouse is a marine biologist at the Scripps 
Institution of Oceanography at the University 
of California, San Diego. Interview by 

Chris Woolston. 
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ARISING FROM I. Vandewauw et al. Nature https://doi.org/10.1038/nature26137 (2018) 


The molecular mechanisms that underlie the distinct sensations of 
warmth and heat have remained unclear until recently. In 2016 we 
showed that the transient receptor potential (TRP) ion channel TRPM2 
mediates the sensation of warmth in vivo in mice’, whereas in 2018 Van- 
dewauwetal.” found that three channels—TRPV1, TRPM3 and TRPA1— 
jointly mediate the sensation of painful heat. However, there was a 
discrepancy between the studies: we found that TRPM2 contributed 
tothe thermal response of isolated somatosensory neurons!”, whereas 
Vandewauw etal.” proposed that TRPA1—and not TRPM2—was the only 
thermally activated ion channel in somatosensory neurons apart from 
TRPV1 and TRPM3. Here we show that both TRPM2 and TRPAL, in addi- 
tion to the established thermosensory ion channels TRPV1and TRPM3, 
drive responses of sensory neurons to increases of temperature. 

In our original paper’ we used calcium imaging to show that around 
10% of somatosensory neurons from the dorsal root ganglia (DRG) were 
activated by heat but did not express either functional TRPV1 or TRPM3, 
and we proposed that the thermal response in these neurons was driven 
by TRPM2. Here we use a similar method to test whether expression of 
TRPA1 could instead be the cause of thermal responses in these neurons 
(Fig. 1a; for the protocol of solution application see Fig. 2a), as proposed by 
Vandewauwetal.”. Figure 1a shows that responses to the selective TRPA1 
agonist allyl isothiocyanate (AITC) are observed in around 25% of all DRG 
neurons, almost all of which (24%) also express TRPVI1 and/or TRPM3, in 
agreement with previous studies*>. A similar result was obtained inneurons 
from the trigeminal ganglia (TG)—which innervate the face and head—in 
which 41.5% in total express TRPAI, of which only 2.5% express TRPALin 
the absence of both TRPV1 and TRPM3 (Extended Data Fig. 1a). Notably, 
around 10% of DRG neurons, and 8% of TG neurons, were activated by heat 
but did not express functional TRPAI, TRPV1 nor TRPM3-—similar to the 
proportion of TRPM2-expressing neurons identified in our original study’. 

Figure 1b shows that genetic deletion of 7rpm2 reduced the propor- 
tion of neurons in which thermal responses are not attributable to 
any of TRPV1, TRPM3 or TRPA1 from 10% to 3%, without significantly 
changing the proportions of neurons in other classes; this is consistent 
with TRPM2 driving the majority of thermal responses that are inde- 
pendent of TRPV1, TRPM3 and TRPAI. Apart from neurons expressing 
TRPM2in isolation, thermal responses attributable to TRPM2 are also 
co-expressed with TRPV1 and TRPM3.. Figure 1c confirms this by showing 
that 38% of neurons remain thermally activated when responses medi- 
ated by TRPV1, TRPM3 and TRPA1are blocked by selective antagonists. 
Genetic deletion of Trpm2 reduces the proportion of neurons respond- 
ing to heat when TRPV1, TRPM3 and TRPALare blocked from around 38% 
to around 7% (Fig. 1d). The small remaining proportion of unidentified 
heat-responsive neurons generate responses that are not attributable to 
any of TRPV1, TRPM3, TRPA1or TRPM2. The mean response amplitude 
in wild-type neurons in the presence of inhibitors of TRPV1, TRPM3 and 
TRPAI (Fig. 1c) was AF34o/F3g9 = 1.90 + 0.06 (where F340/F3g9 is the ratio 


of fluorescence intensities excited at 340 nm and 380 nm), whereas 
for unidentified heat responses (Fig. 1d) the response amplitude was 
AF 34o/Fg9 = 1.59 + 0.16 (Fig. le). The unidentified heat response may be due 
to the expression of an additional heat-sensitive mechanism—perhaps 
involving the anion channel Anoctamin 1°—although the possibility that 
it may be due to anon-specific response to heat cannot be excluded. 

Analternative approach to determining the role of TRPM2 isto usea 
selective pharmacological blocker. We chose 2-aminoethoxydiphenyl 
borate (2-APB), which largely blocks TRPM2 at a concentration’ of 
25 uM, although higher concentrations activate members of the TRPV 
family®”°. Figure 2a, b shows that the response amplitude in thermally 
activated neurons with responses that are not attributable to TRPV1, 
TRPM3 or TRPAL is strongly inhibited by 2-APB. When neurons from 
Trpm2™‘ mice were tested in the same way (Fig. 2c), there was no sig- 
nificant effect of 2-APB on the amplitude of the small number of ‘uni- 
dentified’ thermal responses. 

Inneurons responding to only one of the agonists for TRPV1, TRPM3 
or TRPA1 (Extended Data Fig. 2a), 2-APB caused a small reduction in 
thermal response amplitude (not significant in the case of the very 
small number of neurons expressing only TRPALin isolation). Because 
many neurons co-express TRPM2 with other TRP channels (Fig. 1c, d) 
it is likely that this reduction in response is due to an inhibitory effect 
of 2-APB on co-expressed TRPM2. Extended Data Fig. 2b supports this 
possibility, because in neurons from 7rpm2~“ mice there was either 
no change in response amplitude upon application of 2-APB, or an 
increase in response amplitude in the case of TRPV1, which is known 
to be activated by 2-APB*°. 

These experiments show that the responses of the TRPM2-expressing 
neurons identified in our original study’ are not due to the expression of 
TRPAL, as was proposed by Vandewauwetal.”. Thermal responses that 
are not attributable to TRPV1, TRPM3 or TRPA1are almost completely 
eliminated by either genetic deletion of 7rpm2 or pharmacological 
block of TRPM2, confirming our original proposal that these neurons 
express TRPM2'. 

We sought to identify the cause of the discrepancy between the 
above results and the work of Vandewauwet al.”. Changing the source 
of neurons (DRG or TG), the order of application of agonists and the 
heat stimulus, the culture conditions, the rise time of heat application 
and the starting temperature did not explain the discrepancy (Extended 
Data Fig. 1). However, terminating the temperature ramp at a lower level 
of 45 °C—as used by Vandewauwet al.?—rather than at the temperature 
of 48 °C used in our experiments, sharply reduced the proportion of 
heat-sensitive neurons with responses that are not attributable to 
TRPV1, TRPM3 or TRPAI (Fig. 2a, d, e). This suggests that the reason 
why Vandewauw et al.’ did not identify thermally activated neurons 
expressing TRPM2 was that their heat stimulus did not reach a suffi- 
ciently high temperature to activate TRPM2 in these isolated neurons. 


'Wolfson Centre for Age-Related Diseases, King’s College London, London, UK. 7Department of Neurology, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, 
Taiwan. Graduate Institute of Clinical Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan. “e-mail: peter. mcnaughton@kcl.ac.uk 
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Fig. 1| A fraction of thermally activated somatosensory neurons does not 
express TRPV1, TRPM3 or TRPAI. a, Proportions of DRG neurons expressing 
TRPV1, TRPM3 and TRPA1, and of heat-sensitive neurons expressing none of 
these TRP channels, obtained from increases in F34o/F3go in response to the 
application of selective agonists (see Methods for details). The experimental 
protocol is shown in Fig. 2a (except that TRPM2 blocker 2-APB was not applied). 
The pie chart shows mean data from 3 experiments as follows: neurons 
expressing only TRPV1 (19.6 + 2.0%, mean +s.e.m.); only TRPM3 (13.5 + 0.2%); 
both TRPV1land TRPM3 but not TRPA1 (16.2 + 3.5%); TRPA1 together with either 
TRPV1 or TRPM3 (24.4 +1.6%); TRPA1alone and responding to heat (0.2 + 0.1%); 
TRPA1aloneand not responding to heat (0.5 + 0.2%); heat-sensitive neurons 
not responding to agonists for any of TRPV1, TRPM3 or TRPA1 (9.5 + 0.2%); and 
neurons not responding to heat nor to any of the TRP agonists but identified as 
viable from their response to a final pulse of KCI (16.14 2.6%). See criteria 

in Methods for the identification of positive heat and agonist responses. Data 
from n=1,563 neurons, 13 coverslips from 3 wild-type mice. b, Similar 
experiment performed onDRGneurons from 7rpm2~ mice. There is no 
significant difference in the proportions of any of the agonist-sensitive 
neurons compared with the wild type (P> 0.05, multiple t-test with Bonferroni 
correction). However, the proportion of heat-sensitive neurons with responses 


The deletion of 7rpm2 alters the behavioural sensation of warmth 
in vivo', but does not affect thermal sensation in the range of noxious 
heat, at temperatures of 42 °C and greater’?". In addition, noxious 
heat sensation is abolished when 7rpu1, Trpm3 and Trpal—but not 
Trpm2-are deleted’. These observations suggest that TRPM2 is acti- 
vated inthe range of 30 °C to 40 °C in vivo. However, the experiments 
described here show that, following neuronal isolation, the tempera- 
ture threshold increases into the noxious temperature range, as is also 
found in expression systems”. A possible explanation is that a factor 
regulating the thermal sensitivity of TRPM2 may be lost following 
neuronal isolation. 

Finally, we tested whether somatosensory neurons expressing TRPA1 
can respond to thermal stimuli, as proposed by Vandewauw et al.?. 
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that are not attributable to TRPV1, TRPM3 or TRPALis significantly reduced by 
deletion of 7rpm2, from 9.5 + 0.2% in the wild type to 2.5+0.8% in Trpm2* mice 
(P=0.009, multiple t-test with Bonferroni correction). Data fromn=660 
neurons, 14 coverslips from3 Trpm2~ mice. c, Proportion of DRG neurons 
responding to heat in the presence of an inhibitor cocktail blocking TRPV1, 
TRPM3 and TRPAI. See Extended Data Fig. Ic, top trace, for protocol. 

The mean percentage of heat-responsive neurons is 38.2 + 4.4% and 

AF 340/F3g9 = 1.90 + 0.06 (mean +s.e.m.). Data from n=1,315 neurons, 4 wild-type 
mice. d, Similar experiment to that showninc, buton DRGneurons from 
Trpm2* mice. The mean percentage of ‘unidentified’ neurons with heat 
responses that are not attributable to TRPV1, TRPM3, TRPA1nor TRPM2 was 
6.9+2.2% (mean +s.e.m., significant difference from wild type, P=0.0007, 
unpaired t-test) with AF34o/F3g9 =1.59 + 0.16 (mean+s.e.m., significant 
difference from wild type, P= 0.0237, unpaired t-test). Data fromn=1,794 
neurons, 4 Trpm2~“ mice. e, Amplitudes of AF349/F3g9 in response to heat ramp 
from the experiments shown inc (wild-type neurons) and d (7rpm2~“" neurons). 
The threshold for accepting a significant heat response is shown by the vertical 
dotted line, which gives the intrinsic heat-sensitivity of the fura2 dye obtained 
froma heat ramp applied ina Ca-free solution (see details in Methods). Mean 
values of AF;40/F3g9 for neurons exceeding the threshold are givenincandd. 


We tested the heat sensitivity of identified TRPA1-expressing neurons 
from Trpm2™ mice, in which TRPV1and TRPM3 were pharmacologically 
blocked (Extended Data Fig. 3). We found that around 30% of all neu- 
rons were activated by a TRPA1 agonist—similar to the proportion of 
TRPA1-expressing neurons in the experiments shown in Fig. 1. In 34.7% 
of these TRPAI* neurons we observed a response to heat that was inhib- 
ited by blocking TRPAI. Ina further 5.5% of the total number of neurons, 
the response to heat was not due to any of TRPM2, TRPV1, TRPM3 nor 
TRPA1 (unidentified heat-responders, see also Fig. 1d). We concludethat 
TRPA1—often assumed to be activated only by cold’—does contribute to 
heat responses in somatosensory neurons, as reported by Vandewauw 
etal.’.Insummary, there is clear evidence that TRPV1, TRPM3, TRPM2 and 
TRPAI1 contribute to the thermal responses of somatosensory neurons, 
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Fig. 2 | Pharmacological block of TRPM2 inhibits the heat response of 
heat-sensitive neurons not expressing TRPV1, TRPM3 or TRPAL. a, Effect of 
pharmacological block of TRPM2 on identified somatosensory neurons. 
Neurons expressing TRPV1, TRPM3 and/or TRPA1 were identified by brief 
application of the specific agonists capsaicin (Caps, 111M), pregnenolone 
sulfate (PS, 100 pM) and AITC (50 pM), respectively, followed by a heat ramp to 
48 °C to activate all heat-sensitive neurons. The effect of the TRPM2 blocker 
2-APB (25 1M) on heat-activated neurons was tested, followed by a pulse of KCI 
(SO mM) to test the viability of all neurons. At the end, the heat pulse was 
repeated in Ca-free solution to identify changes in F34o/F3g9 (AF34o/F3g0) arising 
from sensitivity of the fura2 dye to temperature. The example trace shows that 
the response to heat in aneuron not expressing any of TRPV1, TRPM3 or TRPAL 
is strongly suppressed by the TRPM2 blocker 2-APB (25 LM). b, The increasein 
calcium-dependent fluorescence ratio F34o/F3g9 in response to heat in 
heat-sensitive neurons with responses that are not attributable to TRPV1, 
TRPM3 or TRPALis strongly suppressed by 2-APB (25 1M) (n=149 neurons from 
3 wild-type mice). The increase in F34o/F3g9 is corrected for each neuron by 


and that a further unidentified heat sensor is present ina small minority 
of neurons. 


Methods 
Mice 
Male mice onaC57BL/6J background, 9-18 weeks old, were used for all 


experiments. Trpm2” mice were a gift from Y. Mori. Mice were main- 
tained ona12hlight/12 h dark cycle. 
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subtracting the increase in F;,9/F33) measured from the heat ramp applied in 
Ca-free solution (see Fig. 2a). HBSS, Hank’s balanced salt solution. c, Similar 
experiment on 7rpm2~ neurons. The few heat-sensitive neurons with 
responses that are not attributable to TRPV1, TRPM3 or TRPAI (Fig. 1b) showno 
significant effect of 2-APB onresponse amplitude (n =20 neurons from3 
Trpm2* mice). d, The percentage of responding neurons activated at 48 °C 
(ref.) or at an earlier time point when the temperature had reached 45 °C (ref.”), 
as shownina. The lower temperature reduced the proportion of heat-sensitive 
neurons with responses that are not attributable to TRPV1, TRPM3 or TRPA1 
from 9.5 + 0.2% in response to heating to 48 °C to2.2+ 0.2% inresponse to 
heating to 45 °C (P< 0.0001, unpaired t-test). n=1,563 neurons from 3 wild-type 
mice. e, Amplitudes of AF340/F3g9 in response to heat ramp reaching 48 °C and 

45 °C, from experiments shown ina. The threshold for designating a heat 
response is shown by vertical dotted lines, which show the intrinsic heat 
sensitivity of the fura2 dye obtained from heat ramp applied ina Ca-free solution 
(see details in Methods). The mean AF;,./F3g, in response to 48 °C was 2.04 +0.11 
and to 45 °C was 0.67 + 0.03 (n=149 neurons, P< 0.0001, unpaired t-test). 


Primary neuron cultures 

Mice were euthanized by cervical dislocation, after which DRG or TG 
ganglia were excised then incubated in papain (2 mg mI in Ca**-free 
and Mg”'-free HBSS) for 30 min at 37 °C, followed by incubation in 
collagenase (2.5 mg ml in Ca”*-free and Mg”*-free HBSS) for 30 min 
at 37 °C. Ganglia were re-suspended and mechanically dissociated 
in Neurobasal-A/B27 growing medium, which was prepared with 
Neurobasal-A medium supplemented with 0.25% (v/v) L-glutamine 
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200 mM (Invitrogen), 2% (v/v) B-27 supplement (Invitrogen), 1% (v/v) 
penicillin/streptomycin (Invitrogen) and nerve growth factor (NGF) 
(Sigma-Aldrich) at 50 ng mI unless otherwise specified in the figure 
legends. Dissociated neurons were centrifuged and plated onto cover- 
slips pre-coated with poly-L-lysine (10 pg mI”) and laminin (40 pg mI’). 
Neurons were kept in a 37 °C incubator with a 95% air/5% CO, atmos- 
phere for at least 12 h before use, and all neurons were used within 24 h. 


Extracellular solutions and perfusion system 

Unless otherwise specified, allexperiments were carried out with an extra- 
cellular solution containing 140 mM NaCl, 4 mM KCI, 1.8 mM CaCl, 1 mM 
MgCl,, 10 mM HEPES and 5 mM glucose; the pH was adjusted to 7.4 with 
NaOH andthe osmolarity was between 295-310 mOsm. Calcium-free extra- 
cellular solution was prepared with the formulation above except for the 
omission of calcium chloride. An eight-line manifold gravity-driven system 
controlled by an automated solution changer with acommon outlet was 
used to apply solution to the cells. The solution was heated with a Peltier 
device regulated by a proportional gain feedback controller designed by 
V. Vellani (CV Scientific). In separate control experiments, the temperature 
in each experimental protocol was recorded daily by a miniature ther- 
mocouple at the cell location. All compounds applied were prepared as 
stock solutions first and then diluted to the concentration needed before 
experiments. Drugs used were 11M capsaicin (5mMstockinethanol), 100 
uM pregnenolone sulfate (SOO mM stockin DMSO), 25 uM 2-APB (SOO mM 
stockin DMSO),50 uM AITC (200 mMstockin DMSO), 100 UMHC030031 
(100 mM stockin DMSO), 10 pM naringenin (100 mMstockin DMSO) and5 
EMAMG9810 (SO mMstockin DMSO). The inhibitor cocktail in Fig.1c-eand 
Extended Data Fig. lc was composed of the TRPA1 antagonist HCO30031, 
the TRPV1 antagonist AMG9810 and the TRPM3 antagonist naringenin. A 
similar inhibitor cocktail but with the TRPA1 antagonist omitted for part 
of the experiment was used in Extended Data Fig. 3. 


Calcium imaging 

Cells were loaded with 5 pM fura-2-AM (Invitrogen) with 0.02% (v/v) 
pluronic acid (Invitrogen) in Neurobasal-A medium for 30 min at 37 °C. 
After loading, coverslips were placed in an imaging chamber in extracel- 
lular solution as above and transferred to a Nikon Eclipse Ti-E inverted 
microscope. Cells were continuously perfused with extracellular solu- 
tion and were illuminated with a monochromator alternating between 
340 nm and 380 nm (OptoScan; Cairn Research), controlled by WinFluor 
3.2 software (J. Dempster, University of Strathclyde, UK). Emission was 
collected at 510 nm and the resulting pairs of images were acquired every 
two seconds with a 100-ms exposure time using an ixon 897 EM-CCD 
camera (Andor Technology). Image time series were converted to TIFF 
files and processed with Image) software. Images of the background 
fluorescence intensity were obtained for both wavelengths and sub- 
tracted fromthe respective image stack before calculating the F;4,/F3so 
images. A minority of neurons exhibited an unstable F549/Fg) baseline in 
the absence of any applied stimulus—usually caused by poor dye load- 
ing but insome cases apparently due to low-frequency repetitive firing 
even inthe absence of any treatment—and were removed from analysis. 
A positive response to all agonists was defined from the rate of increase 
Of F40/F3g9 ratio after agonist application between two consecutive time 
points after the application of the agonist, which exceeds the mean + 
3.09 s.d. (cumulative probability value of 99.9%) of all such differences 
in the absence of any agonist. A heat-sensitive neuron is defined asa 
neuron with a peak increase in F,,,/F3g9 during a heat stimulus that is 
larger than the mean + 3.09 s.d. of the peak increase in F4./F3.) during 
aheat stimulus in Ca-free solution in the same experiment (see Fig. 2a). 


Statistical analysis 
All data are expressed as mean + S.e.m. Statistical analyses were per- 
formed with GraphPad Prism version 7.04 and data were tested for 
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approximation to normality. All statistical tests were two-sided. All 
measurements were taken from distinct samples. The statistical test 
used is stated in each figure legend. 


Ethical approval 

Ethical approval was given by the Animal Welfare Ethical Review Board, 
King’s College London and by the Home Office, UK Government. Inthe 
case of the experiment shown in Extended Data Fig. 1a, approval was 
given by the Institutional Animal Care and Use Committee, Kaohsiung 
Medical University. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All data are available from the corresponding author upon reasonable 
request. 
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Extended Data Fig. 1|See next page for caption. 
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Extended Data Fig. 1| Changing the source of neurons, the order of 
application of agonists and the heat stimulus, the culture conditions, the 
rise time of heat application and the starting temperature donot 
significantly affect the proportions of heat-sensitive neurons with 
responses that are not attributable to TRPV1, TRPM3 or TRPAL.a, Neurons 
from the TG were tested for heat and agonist sensitivity. Left, percentage 
distributions of heat-sensitive and agonist-sensitive neurons. TRPALis strongly 
co-expressed with TRPV1 or TRPM3 (39%), with few neurons expressing TRPA1 
alone (2.5%). Heat-sensitive neurons that respond to heat but do not express 
any of TRPV1, TRPM3 or TRPA1can be clearly identified (8%). n=388 neurons 
from 1wild-type mouse. Right, similar experiment onn=382 TG neurons from1 
Trpm2™‘ mouse. The proportion of heat-sensitive neurons with responses that 
are not attributable to TRPV1, TRPM3 or TRPALis significantly reduced 
compared with the wild type (from 8% to 3%, P= 0.0077, Fisher’s exact test). 
The mean response amplitude of heat-sensitive neurons with responses that 
are not attributable to TRPV1, TRPM3 or TRPA1 was not significantly 

reduced by deletion of Trpm2 (AF 34o/F 3g = 1.13 + 0.05, n =31 for wild type to 
0.96 + 0.08, n=13 for Trpm2~, difference not significant, P= 0.0539, unpaired 
t-test). b, Neurons from the DRG were tested for heat and agonist sensitivity 
using the protocol shown in Fig. 2a, but without the application of 2-APB and 
with agonists presented after heat rather than before. Left, neurons were 
cultured for 12h in NGF‘. See Methods for details. The pie chart shows data 
from 3 experiments ona total of 1,312 neurons from 3 wild-type mice on 

12 coverslips as follows: neurons expressing only TRPV1 (32.2 + 4.2%, 

mean +s.e.m.); only TRPM3 (8.2+1.2%); both TRPV1and TRPM3 but not TRPAL 
(13.3 + 0.6%); TRPAI together with either TRPV1 or TRPM3 (13.3 + 0.9%); TRPA1 
alone and responding to heat (0.2 + 0.2%); TRPA1alone and not responding to 
heat (0.4 + 0.2%); heat-sensitive neurons responding to heat but not to agonists 
for any of TRPV1, TRPM3 or TRPA1 (7.6 + 2.0%); and neurons not responding to 
heat nor to any of the TRP agonists but that were identified as viable from their 
response toa final pulse of KCl (24.8 +1.8%). The application of agonists before 
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(Fig. 1a) or after the heat pulse (this Figure) does not significantly affect the 
proportion of heat-sensitive neurons with responses that are not attributable 
to TRPV1, TRPM3 or TRPAI (9.5 + 0.2% when agonists are applied before (Fig. 1a) 
and 7.6 + 2.0% when applied after, P= 0.3740, unpaired f-test). The proportions 
of neurons in other categories are also not significantly affected, apart froma 
small reduction in the proportion of neurons expressing TRPA1in combination 
with TRPV1 and/or TRPM3 (compare with Fig. 1a). Right, neurons cultured for 
12hin medium containing glial cell line-derived neurotrophic factor (GDNF) at 
2ng mI‘ and neurotrophin 4 (NT4) at 10 ng mI“, the same culture protocol used 
by Vandewauwet al.”. 2 =1,521 neurons from 3 wild-type mice on12 coverslips. 
Thealtered culture conditions had no significant effect on the proportions of 
heat-sensitive and agonist-sensitive neurons (P> 0.05, multiple ¢-test with 
Bonferroni correction). c, Left, protocols for determining the total proportion 
of DRG neurons in which heat responses are partly driven by amechanism 
independent of TRPV1, TRPM3 or TRPAIL. The heat ramp is first applied in a Ca- 
free solution, to identify changes in the F34o/F3go fluorescence ratio (AF 34o/F3go) 
arising from sensitivity of the fura2 dye to temperature, followed by an 
identical ramp inthe presence of acocktail of inhibitors of TRPV1(AMG9810, 
5M), TRPM3 (naringenin, 10 pM) and TRPA1(HC-030031, 100 iM) (see details 
in Methods). A final pulse of KCI confirms viability of the neurons. Right, top: 
heat application in which the temperature was increased from 33 °C to 48 °C 
over 180s (ref.'). n=710 neurons from 2 wild-type mice on3 coverslips. Middle, 
heat application in which the temperature was increased from 33 °C to 48 °C 
over 25s (ref.”).n=99 neurons from 1 wild-type mouse on2 coverslips. Bottom, 
heat application in which the temperature was increased from 23 °C to 48 °C 
over 180 s (ref.”).n = 657 neurons from1 wild-type mouse on2 coverslips. 
Reducing the rise time of heat application or the starting temperature does not 
reduce the total proportion of heat-sensitive neurons with responses that are 
notattributable to TRPV1, TRPM3 or TRPAL. Differences not significant, 
P=0.9197 (middle) and 0.3339 (bottom) respectively, unpaired t-test compared 
totop panel. 
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Extended Data Fig. 2 | Effect of TRPM2 blocker 2-APB on heat responses in which is extensively expressed in somatosensory neurons (Fig. 1c). b, Similar 
neurons expressing only one of TRPAI, TRPV1 or TPRPM3. Neurons were experiments on 7rpm2“ neurons. Significance level during 2-APB application 
identified as expressing only one of TRPAI (left), TRPV1 (middle) or TRPM3 compared to that after washis as follows: *P< 0.05; **P< 0.01; ***P< 0.001; NS, 
(right) and the effect of 2-APB (25 11M) was tested as shown in Fig. 2a. P>0.05. One-way ANOVA followed by Bonferroni's post hoc correction. 

a, Wild-type neurons. Note that many of these neurons will co-express TRPM2, 
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Extended Data Fig.3|A fraction of TRPAI‘ neurons from Trpm2~ mice 
responds to heat when TRPV1 and TRPM3are blocked. a, Top trace shows 
protocol of solution changes used to identify neurons expressing TRPA1and to 
test their activation by heat. TRPV1and TRPM3 were blocked by AMG9810 

(5 uM) and naringenin (10 pM) as shown. The calcium increase elicited by a heat 
ramp from 33 °C to 48 °C was tested in the absence (heat ramps 1and 3) and the 
presence (ramp 2) of the TRPA1 blocker HCO30031 (100 EM). Responses to the 
TRPA1-selective agonist AITC were tested as shown. The response to KCl 

(SO mM) and tothe heat ramp in the absence of external Ca was tested at the 
end. Middle trace shows increases in F 349/33) ina neuron expressing TRPALin 
which the heat-activated Ca increase was suppressed by the TRPAI blocker 
HC030031. TRPA1 was expressed in 31.2 + 4.2% of the total population, but only 
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HBSS 100..MHC030031 Wash 


a fraction of TRPAI’ neurons (34.7% + 9.81%) gave a significant Caincreasein 
response to 48 °C heat. Lower trace shows a different neuroninsame 
preparation that is activated by heat but does not express TRPA1 (unidentified 
heat-responders). Representative data from 3 separate experiments ona total 
of 964 neurons. b, Heat response in TRPAI1-expressing neurons is suppressed 
by the TRPA1 blocker HCO30031 (see a for protocol (top) and example trace 
(middle)).***P< 0.001, one-way ANOVA followed by Bonferroni's post hoc 
correction. c, Heat response in neurons not expressing TRPA1 (unidentified 
heat-responsive neurons, see a for example trace (bottom)) is not significantly 
reduced by the TRPA1 blocker HCO30031 (100 tM). The proportion of 
unidentified heat-responders (5.5% +1.4%) is not significantly different from 
that in Fig. 1d. 
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Data collection Calcium imaging data were collected using WinFluor 3.2 software (J. Dempster, University of Strathclyde, UK). Image time series were 
converted to TIFF files and processed with ImageJ software. This is stated in Methods 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes were determined from prior experience and were typically n>1000 neurons from n> or =3 animals. In Extended Data Fig. 1, 
smaller numbers of neurons and animals were used but numbers were always enough to give adequate statistical significance. 


Data exclusions — Cells that did not respond to a depolarizing solution (see Fig. 2A) were excluded from analysis. A minority of neurons (c. 2%) exhibited an 
unstable F340/380 baseline in the absence of any applied stimulus, usually caused by poor dye loading but in some cases apparently due to 
low-frequency repetitive firing even in the absence of any treatment, and were also removed from analysis. This is stated in Methods. 


Replication Individual experiments were conducted on neurons in a single field of view on a single cover slip (typically c. 100 neurons in a single field of 
view), isolated from a single animal. Experiments were repeated on approx 4 cover slips from each single animal. To control for possible 
variability between animals this procedure was repeated on typically 3 animals (further details under "Sample Size" above). 
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Randomization — Not relevant as all animals were of the same strain (C57BL/6J, either WT or TRPM2-/-), were adult (9-18 weeks) and were raised under 
identical conditions. Stated in Methods. 


Blinding Data collection was automated so blinding was not necessary. The only exception was the exclusion of a small number of neurons exhibiting 
an unstable baseline, see above, which was judged manually. The proportions of these unstable neurons were similar in different treatment 
groups. Baseline instability was obvious and there is no possibility that results could have been skewed by biased omission of more neurons 
from certain treatment groups. 
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Laboratory animals Male mice on a C57BL/6J background, 9 -18 weeks old, were used for all experiments. Stated in Methods. 

Wild animals The study did not involve wild animals. 

Field-collected samples The study did not involve samples collected from the field. 

Ethics oversight Ethical approval was given by the Animal Welfare Ethical Review Board, King's College London and by the Home Office, UK 


Government, or in the case of the experiment shown in Extended Data Fig. 1A, by the Institutional Animal Care and Use 
Committee, Kaohsiung Medical University. Stated in Methods. 
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® Check for updates 


REPLYING TO B. Vilar et al. Nature https://doi.org/10.1038/s41586-020-2510-7 (2020) 


Inarecent study!, we demonstrated that acute noxious heat sensing in 
mice depends on a trio of redundant heat-activated transient recep- 
tor potential (TRP) ion channels: TRPV1, TRPM3 and TRPAL1. In the 
accompanying Comment’, Vilar et al. do not challenge the findings of 
our original study but refer to the role of another heat-activated TRP 
channel, TRPM2, in mediating heat responses in isolated somatosen- 
sory neurons. Inan earlier study’, Tan and McNaughton had proposed 
that TRPM2 underlies TRPM3- and TRPV1-independent heat responses 
in isolated neurons and mediates warmth sensing, whereas findings 
reported in our study did not endorse this conclusion. Here we provide 
results that reconcile the apparent discordance between these earlier 
studies. Overall, our data and those of Vilar et al.” indicate that TRPM2 
can mediate heat responses in isolated sensory neurons, but primarily 
at temperatures greater than 45 °C. 

In 2016, Tan and McNaughton reported a new heat-sensitive mech- 
anism in mouse sensory neurons, independent of the previously 
identified heat-sensitive TRP channels TRPV1 and TRPM3°. Neurons 
expressing this heat-sensing mechanism were activated by increasing 
temperatures over a range of temperatures between 34 °C and 48 °C 
(ref.*). These heat responses were potentiated by H,O, and mediated by 
acalcium-permeable cation channel, leading the authors to propose the 
redox-sensitive channel TRPM2as a molecular candidate’. In support 
thereof, Tan and McNaughton found a lower proportion of TRPM3- 
and TRPV1-independent heat-responsive neurons in TRPM2-deficient 
mice. Because these mice also exhibited a deficit in the avoidance of 
non-noxious warm temperatures, they concluded that TRPM2 acts as 
a warmth sensor in sensory neurons’. 

In our recent study, we demonstrated that acute noxious heat sens- 
ing in mice depends on atrio of TRP ion channels: TRPV1, TRPM3 and 
TRPAL. Using isolated sensory neurons, we showed robust responses to 
anoxious heat stimulus of 45 °C provided that at least one of these three 
TRP channels is functional’. Notably, the combined elimination of all 
three TRP channels largely abolished sensory neuronal heat responses, 
and this was paralleled by loss of heat sensitivity in intact nerve ter- 
minals in the skin and a complete deficiency of acute heat-induced 
pain in vivo’. We further reported that residual heat-induced currents 
and calcium responses in sensory neurons from 7rpm3” Trpv1” dou- 
ble knockout (DKO™?™) mice are potentiated by H,O,, but also that 
these responses are inhibited by the TRPA1 antagonist HCO30031 
and are largely abolished in Trpv1’ Trpm3” Trpal~ triple knockout 
(TKO) mice’. We concluded that TRPM3- and TRPV1-independent, 
H,O,-stimulated heat responses are primarily mediated by TRPA1 
rather than by TRPM2'. 

Inthe accompanying Comment, Vilar et al.? examine several possible 
reasons why both studies came to different conclusions regarding the 


contributions of TRPM2 and TRPAI (refs. !?). First, we note that their data 
endorse the contribution of TRPA1 to TRPM3- and TRPV1-independent 
heat responses. Second, Vilar et al.” pinpoint a potential explanation for 
the discrepancies between the studies concerning the role of TRPM2 
in mediating heat responses—namely the stimulation temperature. In 
our original study', we used a1-min heat pulse that reached a maximum 
temperature of 45 °C, whereas Tan and McNaughton used a slower 
temperature ramp with a maximum of 48 °C (ref. ’) after about 3 min. 
Vilar et al.? now report that activation of the TRPM2-dependent heat 
responses occurs mostly at temperatures greater than 45 °C. This 
observation differs quantitatively from the results in their earlier 
study, in which the majority of TRPM3- and TRPV1-independent heat 
responses reported thresholds between 34 °C and 44 °C (ref. °), butis 
fully consistent with our earlier conclusion that TRPM2in somatosen- 
sory neurons does not contribute significantly to responses to a 45 °C 
heat stimulus’. 

To corroborate our earlier conclusion that TRPM2 is not the main 
mediator of TRPM3- and TRPV1-independent heat responses at temper- 
atures up to 45 °C’, we performed intracellular calcium-imaging experi- 
ments in sensory neurons isolated from the dorsal root ganglia (DRG) 
and trigeminal ganglia (TG) of DKO™™ mice. Neurons were stimulated 
three times using our standard heat stimulus witha maximum at 45 °C, 
with the first and third heat stimuli performed in the presence of the 
TRPM2 antagonist 2-APB (250 uM), and the second application in the 
absence of this antagonist (Fig. 1a). A large subpopulation of both TG 
(47%) and DRG (34%) neurons showed robust heat responses to the first 
stimulation and gradually smaller responses to the second and third 
stimulus, most probably representing response rundown (Fig. 1a, b). 
The fact that these responses were not inhibited by 2-APB confirms that 
they are independent of TRPM2. Only a very small subset of neurons 
(less than 1% in TG and around 2.5% in DRG) exhibited a robust heat 
response in the absence, but not in the presence, of 2-APB (Fig. 1a, b). 

Combined elimination of TRPV1, TRPM3 and TRPA1in the TKO mice 
largely abolished neuronal responses to a 45 °C stimulus’. To assess 
the proposal of Vilar et al.? that TRPM2 acts as a heat sensor at higher 
temperatures, we performed experiments in which we stimulated TKO 
neurons with a heat ramp reaching a maximum temperature of 50 °C 
(Fig. 2a). Effectively, we observed robust heat responses in around 32% 
ofthe TKO neurons when using this stronger stimulation protocol, and 
around 90% of these responses were reversibly inhibited by 25 1M 2-APB 
(Fig. 2a, b). Overall, these data are consistent with the conclusion of Vilar 
et al.” that TRPM2 contributes to heat responses in isolated sensory 
neurons, but primarily at temperatures greater than 45 °C. To better 
quantify differences in thermal dependence, we estimated thermal 
thresholds for sensory neurons exhibiting heat responses attributed 
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Fig. 1| TRPV1- and TRPM3-independent responses toa45 °C heat stimulus are 
not inhibited by the TRPM2 antagonist 2-APB. a, Representative examples of 
responses of DKO™*“! DRG neurons to consecutive heat stimuli with amaximum 
temperature of 45 °C in the presence and absence of the TRPM2 antagonist 2-APB 
(250 uM, a supramaximal concentration to ensure full inhibition of TRPM2). 


either to TRPM2 or to TRPAI (Fig. 2c). For the 2-APB-insensitive heat 
responses in DKO™“™ neurons, which are primarily mediated by TRPAL’, 
we obtained an average threshold of 41.3 + 0.4 °C (n=37). In compari- 
son, 2-APB-sensitive heat responses in TKO neurons, which we attribute 
to TRPM2, exhibited a significantly higher threshold of 48.0 + 0.2 °C 
(n=111; P< 10°; Mann-Whitney test) (Fig. 2c). 

Taken together, the results in the accompanying Comment? and 
the results we present here and in our previous study! converge ona 
model whereby four different TRP channels (TRPV1, TRPM3, TRPA1and 
TRPM2) are able to generate heat responses in isolated sensory neurons. 
Moreover, the small subset of 2-APB-insensitive heat-responsive TKO 
neurons (Fig. 2a, b) implies the existence of at least one additional 


[Ca**],, intracellular calcium concentration. b, Proportion of DKO“*™ TG neurons 
(n=274 from 2 mice) and DRG neurons (n= 452 from 2 mice) that exhibit 
2-APB-sensitive and 2-APB-insensitive heat responses. Dataina, bare 
representative of six independent experiments. 


heat sensor that is yet to be identified. Most importantly, the collec- 
tive findings demonstrate that the relative contribution of TRPM2 
to the neuronal heat responses crucially depends on the stimulation 
temperature. 

TRPV1, TRPM3 and TRPA1are functional in intact sensory nerve ter- 
minals in the skin, where they initiate pain responses to locally applied 
chemical agonists or noxious heat’*®. Mice that lack all three of these 
channels lack a heat-withdraw reflex’, indicating that TRPM2 does not 
incite heat pain signals in vivo. Notably, TRPM2-deficient mice report- 
edly have a deficit in warmth sensing, and largely lack the ability to 
discriminate between 33 °C and 38 °C (ref. 3). There is a disconnect of 
at least 10 °C between the temperature range in this behavioural assay 
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Fig. 2 | High-threshold 2-APB-sensitive heat responses in TKO sensory 
neurons. a, Responses of TKO DRG neurons to three consecutive heat ramps with 
amaximum temperature of 50 °C in the presence and absence of 2-APB (25 1M); 
shown are representative examples of a 2-APB-sensitive and a 2-APB-insensitive 
heat responder. b, Proportion of TKO DRG neurons (n = 266 from 2 mice) that 
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exhibit 2-APB-sensitive and 2-APB-insensitive heat responses. c, Comparison of 
the thermal threshold of TRPA1-dependent heat responses (2-APB-insensitive 
heat responses in DKO™?™ neurons) and 2-APB-sensitive heat responders in TKO 
neurons (attributed to TRPM2). The solid line indicates the mean. 


and the thermal threshold of approximately 48 °C that is observed for 
TRPM2-dependent responses in isolated neurons. As pointed out by 
Vilar et al.?, unidentified cellular factors that increase the sensitivity 
of TRPM2to warmth may become lost during the process of collecting 
and culturing sensory neurons. The behavioural deficit in non-noxious 
warmth avoidance in TRPM2-knockout mice may also reflect the role 
of this channel inthe preoptic area, aregion of the hypothalamus that 
is involved in thermal homeostasis and behaviour’. 


Methods 


We used 10-14 week-old male Trpm3”’ Trpv1’ (DKO™?™') and 
Trpal’ Trpm3” Trpv1’ (TKO) mice on a C57BL/6J background’. Mice 
were housed inaspecific-pathogen-free facility at21 °Cona12hlight/12h 
dark cycle with unrestricted access to food and water. All experiments 
using mice were approved by the KU Leuven Ethical Committee Lab- 
oratory Animals under project number P265/2015. The procedures 
for culturing primary sensory neurons and for fura-2-based calcium 
imaging were identical to those in our original study’. The thermal 
threshold was determined as the temperature at which the second 
time derivative of the intracellular calcium concentration reached a 
maximum during a heat ramp. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The data supporting the findings of this study are available within the 
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